Cytoplasmic Modulators of Integrin Binding/Signalling

ABSTRACT

The present invention relates to purified and isolated polynucleotides encoding a polypeptide which specifically bind to a cytoplasmic portion of an integrin. Specifically, the invention provides a FLP-1-encoding polynucleotide and the polypeptide product of the gene. Expression vectors comprising the polynucleotide, antibodies which recognize the polypeptide, hybridomas which secrete the antibodies, and method to identify modulators of interaction of the polypeptide with  beta 7 subunits sequences are also provided.

BACKGROUND

A significant characteristic of the immune and inflammatory responses is the movement of leukocytes from the bloodstream into specific tissues in response to various physiological signals. For example, certain subsets of lymphocytes "home" to various secondary lymphoid tissues such as lymph nodes or Peyer's patches, and eventually return to circulation. Other leukocytes such as granulocytes and monocytes, however, do not return to circulation after transmigration from the bloodstream. Movement of leukocytes from circulation is effected by a series of receptor/counter-receptor interactions which are coordinated by various specific membrane adhesion molecules.

Extravasation of leukocytes from the bloodstream for review, see McEver, Curr. Opin. Cell Biol. 4:840-849 (1992)! is initially effected by a family of membrane glycoproteins termed selections which are either expressed constitutively or induced in response to specific cytokines. Binding of selections to their counterpart ligand brings leukocytes into close, but not static, contact with vascular endothelial cells. The "tethered" leukocyte then begins a "rolling" process along the endothelium which continues until additional molecular interactions firmly stabilize a specific cell/cell interaction. One of the molecular binding activities which results in the stable interaction is effected by a second family of surface glycoproteins called integrins which possess a higher binding affinity for their respective ligands than selections.

The integrins are heterodimeric surface molecules comprised of an α and a β subunit in non-covalent association. All integrins are transmembrane proteins with counter-receptor binding activity localized in the extracellular domain. Integrins also possess relatively short cytoplasmic regions which participate in transmembrane signaling events. Integrins are capable of interacting with other cell-bound counter-receptors and components of the extracellular matrix, as well as soluble factors. Binding of extracellular ligands leads to crosslinking and localized clustering of integrins Miyamoto, et al., Science 267:833, 1995! and formation of focal adhesions wherein the clustered integrin cytoplasmic domains associate with cytoskeletal components including, for example, actin filaments Pavalko and Otey, Proc. Soc. Exp. Biol. Med. 205:32767, 1994, and Gumbiner, Neuron 11:551, 1993!. While most investigations into integrin physiological activity have focused on identifying specific counter-receptors using immunological methodologies as discussed infra, less is known about the specific interactions of integrins with cytoplasmic components. Mutation studies, however, have indicated that the cytoplasmic sequences are required for integrin association with focal contacts LaFlamme, et al., J. Cell. Biol. 117:437 (1992)!. Other data discussed infra support this observation.

While numerous integrins have been identified, certain subsets are unique to leukocytes, with each member of the subset having characteristic cell-specific expression and counter-receptor binding properties. Of leukocyte-specific integrins, at least three β₂ integrins are known, each comprised of a unique α subunit in association with a β₂ subunit (designated CD18) Kishimoto, et al., Cell 48:681-690 (1987)!. For a recent review of the state of the art with regard to β₂ integrins, see Springer, Cell 76:301-314 (1994). CD11a/CD18, also known as α_(L) β₂ or LFA-1, is expressed on all leukocytes and has been shown to bind to ICAM-1, ICAM-2, and ICAM-3. CD11b/CD18, also know as α_(M) β₂ or Mac-1, is expressed on polymorphonuclear neutrophils, monocytes and eosinophils and has been shown to bind to ICAM-1, complement factor iC3b, factor X, and fibrinogen. CD11c/CD18, also known as α_(X) β₂ or p150,95, is expressed on monocytes, polymorphonuclear neutrophils and eosinophils and has been shown to bind to complement factor iC3b and fibrinogen. In addition, a fourth human β₂ integrin, designated α_(d)β₂, has recently been identified Van der Vieren, et al., Immunity 3:683-690 (1995)!. Recently, it has been demonstrated that the actin-binding protein, filamin, directly binds to a cytoplasmic portion of β₂ subunits Sharma, et al., J. Immunol. 154:3461-3470 (1995)! which suggests a role one or more of for the β₂ integrins in formation of focal contacts and cell motility in general see review in Arnaout, Blood 75:1037 (1990)!.

A second subset of leukocyte specific integrins may be referred to as the α₄ integrins in view of the fact that both members of the family are comprised of a common α₄ subunit in association with either a β₁ or β₇ subunit. For a recent review, see Springer, supra. VLA-4, also referred to as α₄ β₁ or CD49d/CD29, is expressed on most peripheral blood leukocytes except neutrophils and specifically binds VCAM-1 and fibronectin. LPAM-1, also known as α₄ β₇, is expressed on all peripheral blood leukocytes and has been shown to bind MadCAM-1, fibronectin and VCAM-1. Expression of either of the α₄ integrins has also been demonstrated in a wide range of leukocyte cell types in lymphoid organs and in various tissues (Hemler et al., Immunol. Rev. 114:45-60, 1990; Kilshaw et al., Eur. J. Immunol. 20:2201-2207, 1990; Schweighoffer et al., J. Immunol. 151:717-729, 1993; and Lazarovits and Karsh, J. Immunol. 151:6482-6489, 1993). Consistent with the observed participation of β₂ integrins in formation of focal contacts, presumably through filamin binding, it has previously been shown that cytoplasmic portions of β₁ integrins directly bind a-actinin in vitro. While this interaction has not been demonstrated in vivo, it suggests physiological involvement of β₁ integrins in cell mobility and/or maintenance of cell morphology see review in Clark and Brugge, Science 268:233-238 (1995)!.

A number of in vitro and in vivo studies utilizing anti-α₄ monoclonal antibodies have indicated a role for the α₄ integrins in various pathophysiological conditions see review, Lobb and Hemler, J. Clin. Invest. 94:1722-1728 (1994)!. For example, several investigations have provided evidence that α₄ integrins are involved in leukocyte emigration from peripheral blood into regions of inflammation (Weg, et al., J. Exp. Med. 177:561-566, 1992; Winn and Harlan, J. Clin. Invest. 92:1168-1173, 1993). These observations suggest that anti-α₄ antibodies may be capable of ameliorating integrin-associated disease states, and this therapeutic potential has been demonstrated in several animal disease state models. For example, bolus injection of antibodies to α₄ integrins delayed the onset of paralysis in rat and murine experimental allergic encephalomyelitis (Yednock, et al., Nature 356:63-66, 1992; Baron, et al., J. Exp. Med. 177:57-68, 1993). Prophylactic administration of anti-α₄ antibodies reduced ear swelling in murine contact hypersensitivity models (Ferguson, et al., J. Immunol. 150:1172-1182, 1993; Nakajima, et al., J. Exp. Med. 179:1145-1154, 1994). Further, anti-α₄ antibodies were shown to reduce infiltration of pancreatic islets and delay the onset of diabetes in non-obese diabetic mice which are prone to spontaneous development of type I diabetes (Yang, et al., Proc. Natl. Acad. Sci. (USA) 90:10494-10498. 1993; Burkly, et al., Diabetes 43:529-534, 1994; Baron, et al., J. Clin. Invest. 93:1700-1708, 1994). Still other in vivo studies using anti-α₄ antibodies suggest a role for α₄ integrins in allergic lung inflammation (Pretolani, et al., J. Exp. Med. 180:795-805 (1994); Milne and Piper, Br. J. Pharnacol. 112:82Pa(Abstr), 1994); inflammatory bowel disease (Podolsky, et al., J. Clin. Invest. 92:372-380, 1993); cardiac allograft rejection (Paul, et al., Transplantation 55:1196-1199, 1993); acute nephrotoxic nephritis (Mulligan, et al., J. Clin. Invest. 91:577-587, 1993); and immune complex mediated lung injury (Mulligan, et al., J. Immunol. 159:2407-2417, 1993).

Thus there exists a need in the art to identify molecules which bind to and/or modulate the binding and/or signalling activities of the integrins and to develop methods by which these molecules can be identified. The methods, and the molecules thereby identified, will provide practical means for therapeutic intervention in oα₄ integrin-mediated immune and inflammatory responses.

BRIEF DESCRIPTION OF THE INVENTION

In one aspect, the present invention provides novel purified and isolated polynucleotides (e.g., DNA and RNA transcripts, both sense and antisense stands) encoding a filamin-like β₇ integrin binding protein designated FLP-1, or variants thereof (i.e., deletion, addition or substitution analogs) which possess binding and/or immunological properties inherent to FLP-1. Preferred DNA molecules of the invention include cDNA, genomic DNA and wholly or partially chemically synthesized DNA molecules. A presently preferred polynucleotide is the DNA as set forth in SEQ ID NO: 1, encoding the polypeptide according to SEQ ID NO:2.

Preferred polynucleotides of the invention comprise the cDNA set out in SEQ ID NO: 1 and DNAs which hybridize to the non-coding strands thereof under stringent conditions or which would hybridize but for the redundancy of the genetic code. Exemplary stringent hybridization conditions are as follows: hybridization at 42° C. in 5× SSPE and a final wash at 65° C. in 0.2× SSC. It is understood by those of skill in the art that variation in these conditions occurs based on the length and GC nucleotide content of the sequences to be hybridized. Formulas standard in the art are appropriate for determining exact hybridization conditions. See Sambrook, et al., Eds. 9.47-9.51 in Molecular Cloning, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989).

Also provided are recombinant plasmid and viral expression constructs which include FLP-1 encoding sequences, wherein the FLP-1 encoding sequence is operatively linked to a homologous or heterologous transcriptional regulatory element or elements.

As another aspect of the invention, prokaryotic or eukaryotic host cells, transformed or transfected with polynucleotide sequences of the invention, are provided which express FLP-1 polypeptides or variants thereof. Host cells of the invention are particularly useful for large scale production of FLP-1 polypeptides which can be isolated from the host cell itself or the medium in which the host cell is grown.

Also provided by the present invention are purified and isolated FLP-1 polypeptides, including fragments and variants thereof. Novel FLP-1 polypeptides of the invention may be isolated from natural sources, but along with FLP-1 variant polypeptides, are preferably produced by recombinant procedures involving host cells of the invention. Variant FLP-1 polypeptides, including fully glycosylated, partially glycosylated, and wholly de-glycosylated forms of the FLP-1 polypeptide may be generated, depending on the host cell selected for recombinant production and/or post-isolation processing. Additional variant FLP-1 polypeptides include water soluble and insoluble FLP-1 polypeptides and fragments thereof, analogs wherein one or more amino acids are deleted from, replaced in, or added to the preferred FLP-1 polypeptide, polypeptide analogs with equal or enhanced biological activities and/or immunological characteristics specific for FLP-1, and analogs with modified ligand binding and/or signal transducing capabilities. Fusion polypeptides are also provided wherein FLP-1 amino acid sequences are expressed contiguously with amino acid sequences derived from other polypeptides. Fusion polypeptides of the invention include those with modified biological, biochemical, and/or immunological properties in comparison to the preferred FLP-1 polypeptide.

Also contemplated by the present invention are antibodies and other peptide and non-peptide molecules which specifically bind to FLP-1. Binding molecules of this type are particularly useful for purifying FLP-1 polypeptides, identifying cell types which express FLP-1 polypeptides, and assaying for presence or absence of FLP-1 polypeptides in a fluid. Binding molecules are also useful for modulating (i.e., blocking, inhibiting, or stimulating) in vivo binding and/or signal transduction activities of FLP-1. Antibodies of the invention include monoclonal, polyclonal, and recombinant (i.e., humanized, chimeric, etc.) forms and fragments thereof.

Also contemplated by the invention are hybridomas which secrete monoclonal antibodies specifically immunoreactive with FLP-1. Likewise, cell types modified by recombinant means so as to express and/or secrete genetically engineered FLP-1 binding molecules are also comprehended.

Assays to identify FLP-1 binding molecules are also provided, including immobilized ligand binding assays, solution binding assays, scintillation proximity assays, two hybrid screening assays, immunological methodologies and the like. In addition to identifying FLP-1 binding molecules, the same or similar assays are useful for identification of molecules which modulate FLP-1 specific binding. For example, assays to identify modulators (i.e., activators or inhibitors) of FLP-1 specific binding can involve a) contacting FLP-1 or a fragment thereof, with β₇ integrin or a fragment thereof; b) measuring binding between FLP-1 or a fragment thereof, and β₇ integrin or a fragment thereof; c) measuring binding between FLP-1 or a fragment thereof, and β₇ integrin or a fragment thereof in the presence of a test compound, and d) comparing the measurement in step (b) and the measurement in step (c) wherein a decrease in binding in step (c) indicates the test compound in an inhibitor of binding, and an increase in binding in step (c) indicates the test compound is an activator of binding.

Variations on the method to identify modulators of FLP-1 binding can include scintillation proximity assays comprising the steps of immobilizing either FLP-1 or its binding partner on a solid support, wherein the solid support contains a fluorescent agent; modifying the non-immobilized binding partner to include a compound that can excite the immobilized fluorescent agent; contacting the non-immobilized binding partner with the immobilized binding partner; determining the level of light emission for the fluorescent agent; and repeating the procedure in the presence of a putative modulator of FLP-1 binding.

As still another variation of the method, a two hybrid system may be utilized to identify genes encoding potential modulators. In this system, an integrin is expressed in a host cell as a fusion protein with either a DNA binding domain or transactivation domain of a modular transcription factor. A binding partner protein is also expressed as a fusion protein with which ever transcription factor domain is not utilized with expression of the integrin fusion protein. Interaction of the two fusion proteins results in reconstitution of the holo-transcription factor and permits expression of a reporter gene with a promoter specific for the transcription factor. Use of this system in the presence or absence of library cDNA can permit identification of genes that encode proteins which modulate the degree of reporter gene expression.

Additional methods comprehended by the invention include immunological assays including radio-immuno assays, enzyme linked immunosorbent assays, sandwich assays and the like. Co-precipitation methods are also comprehended wherein an antibody immunospecific for one binding partner is utilized in a method in which the other binding partner is detectably labeled. Immunological assays may also include use of labeled antibodies specifically immunoreactive with a complex between the desired binding partners.

Numerous compounds are contemplated as being candidates for testing in methods of the invention. For example, antibody products which are immunoreactive with one binding partner and which modulate binding between the two molecules can be identified by the claimed method. Antibody products contemplated are monoclonal antibodies, and fragments thereof, humanized antibodies, and/or single chain antibodies. Other molecules which can be screened in the claimed method include peptides, small molecules and libraries composed of either of the same.

Modulators of β₇ /FLP-1 and β₇ /filamin interaction identified by the methods of the invention are utilized in vitro or in vivo to affect inflammatory processes involving leukocytes. In addition, modulating compounds which bind to either the β₇ integrin, filamin or FLP-1 are useful to monitor the level of its binding partner, either in a body fluid or biopsied tissue.

Those of ordinary skill in the art will readily appreciate that numerous variations of the claimed method are encompassed by the invention.

DETAILED DESCRIPTION OF THE INVENTION

The present invention is illustrated by the following examples relating to the isolation of a cDNA clone encoding FLP-1. Example 1 relates to identification of genes in a human B cell cDNA library that encode proteins which interact with β₇ integrin. Example 2 describes identification of genes in a human spleen cDNA library which encode proteins that interact with β₇ integrin. Example 3 addresses tissue specific expression of FLP-1. Example 4 describes specificity of interaction between filamin and β₇ and FLP-1 and β7 integrin. Example 5 describes localization of B₇ sequences required for filamin or FLP-1 binding. Example 6 relates to applications for modulators of β₇ /filamin or β₇ /FLP-1 interactions.

EXAMPLE 1

Identification of Genes in a B Cell Library Encoding β₇ Interacting Proteins

The two-hybrid system developed in yeast Durfee, et al., Genes and Development 7:555-567 (1993)! was used to screen for proteins expressed in a human B cell cDNA library which interact with the carboxy-terminal cytoplasmic tail of the β₇ integrin. The yeast two-hybrid screen is based on in vivo reconstitution of the GAL4 transcription factor and subsequent expression of a reporter gene driven by a GAL4 promoter. Briefly, GAL4 DNA-binding and transcription-activating domains are encoded on separate plasmids as portions of fusion proteins. Expression of the fusion proteins and interaction of the expression products results in association of the two GAL4 domains and ultimate expression the β-galactosidase reporter gene under transcriptional control of the GAL4 promoter.

In the present investigation, a "bait" plasmid (pAS1) was constructed that contained sequences encoding the GAL4-binding domain, a trp⁻ selection requirement, a hemagglutinin (HA) epitope tag and cytoplasmic amino acid sequences of β₇ integrin. The β₇ integrin cytoplasmic domain was amplified by PCR using β₇ primers set out in SEQ ID NO:3 and 4.

    NHβ.sub.7 5 CGGATCCTCGGATACCGGCTCTCGGTGAAG (SEQ ID NO: 3)     NHβ.sub.7 3 CGGCTCCTCAGAGAGTGGGACTGTCTGCCT (SEQ ID NO: 4)

Reaction conditions included an initial incubation at 94° C. for four minutes, followed by thirty cycles of: 94° C. for one minute, 50° C. for two minutes, and 72° C. for four minutes. The resulting product was sequenced to rule out PCR-derived errors and subcloned into vector pAS1. A yeast strain, Y190, was transformed with β₇ /pAS1 by standard methods and grown in selective media (trp⁻) to mid-log phase. Cells were lysed in lysis buffer (containing 100 mM Tris, pH 6.8, 2% SDS, 10% glycerol, 5% BME and 0.1% bromo phenol blue) and the equivalent of 5-6×10⁶ cells of protein was separated on a 12% polyacrylamide gel. Proteins in the gel were transferred to a PVDF (Millipore, Bedford, Mass.) membrane by standard methods. Control lanes on the gel contained lysate from Y190 cells transformed with pASI vector alone (containing no β₇ integrin-encoding sequences). Western blotting was performed using antibody 12CA5, immunospecific for the HA epitope tag, (Boehringer Mannheim, Indianapolis, Ind.) and a goat anti-mouse IgG horse radish peroxidase (HRP) secondary antibody. Results, in combination with size determination using SDS-PAGE, confirmed that the fusion protein β₇ integrin cytoplasmic tail/HA/GAL4 DNA-binding domain was expressed at readily detectable levels.

A "target" vector was constructed with vector pACT modified to contain sequences encoding the GAL4 activation domain II fused to a B cell cDNA library and a leu⁻ selection requirement. Lymphocyte cDNA library sequences were inserted at an XhoI site of the vector. β₇ /pAS1-transformed Y190 cells were transformed by standard methods with the pACT-lymphocyte library DNA and cells grown under selective conditions (leu⁻ /trp⁻ /his⁻ /3-aminotriazole). Resulting colonies were tested for β-galactosidase (β-gal) activity by the blue/white selection method well known in the art and forty-four β-gal positive clones were obtained. Sequence analysis of the B cell cDNA-derived pACT inserts in each of the clones revealed twenty novel sequences and twenty four sequences encoding known proteins or portions of known proteins.

Five clones were of particular interest, all of which contained sequences encoding a portion of the non-muscle protein filamin, or actin-binding protein ABP280(emb/X53416), Gorlin, et al., J. Cell Biol. 111: 1089-1105 (1990)!. All five clones were shown to encode the carboxy-terminal portions of filamin (SEQ ID NO: 7) and each clone extended into 3' untranslated portions of the filamin gene. Clone 411 corresponded to sequences in repeat 20 (beginning at nucleotide 6763 in SEQ ID NO: 7) and clones 514, 1521, 1271 and 722 beginning in repeat 23 (each beginning at nucleotide 7513, 7552, 7579, and 7579 in SEQ ID NO: 7, respectively). There was one discrepancy between the published sequence of filamin and the sequences determined in each of the positive clones: all positive clones had an aspartate residue at position 2634, while the published sequence of filamin had a histidine at that position. Of these clones, 1271 was selected for subsequent analysis, and the nucleotide and amino acids sequences of 1271 are set out in SEQ ID NOs: 5 and 6, respectively.

EXAMPLE 2

Identification of Genes in a Human Spleen Library Encoding β₇ Interacting Proteins

The two-hybrid system described in Example 1 was repeated using human spleen cDNA library sequences (Clontech, Palo Alto, Calif.) cloned into an EcoRI site of the target vector pGAD10 (Clontech).

After transformation of the β₇ /pAS1 Y190 strain with the spleen/pGAD10 plasmid and selection as previously described, the resulting colonies were tested for β-gal activity and six positive clones were identified. Sequence analysis of the six β-gal positive clones that revealed five identical clones (from which clone S5 was selected for further analysis) along with clone S3, (the sixth positive clone and distinct from the other five) were identified.

DNA and protein alignments revealed that clones S3 and S5 encode different, but overlapping regions of the same protein, with the S3 insert beginning 5' of the S5 insert, and terminating before the 3' end of clone S5. The DNA sequences of clones S3 and S5 were compared to DNA databases using NCBI Blastn with default parameters on Oct. 16, 1995, and both clones were found to exhibit approximately 70% identity to filamin. The nucleotide and amino acid sequences of clone S3 are set out in SEQ ID NOs: 9 and 10, respectively. Sequences for clone S5 are set out in SEQ ID NOs: 11 and 12, respectively. The composite protein encoded by the overlapping clones S3 and S5 was designated FLP-1 (filamin like protein). Blastp search of protein database (NCBI Blastp) revealed that the composite protein FLP-1 has a 73% identity to filamin. Alignment of FLP-I to filamin shows that clones S3 and S5 represent carboxy terminal regions of FLP-1. When FLP-1 is aligned with filamin in the second hinge region between repeats 23 and 24, the putative glycoprotein binding region, the degree of identity drops to 38%, suggesting a difference in binding affinity between filamin and FLP-1 for membrane glycoproteins.

In addition, a region of clone S5 was further found to exhibit 100% identity to truncated actin-binding protein TABP (GP or GE/M62994), a protein previously shown to be is a truncated, non-actin-binding filamin-like protein Leedman, et al., Proc. Natl.Acad. Sci. (USA) 90:5994-5998 (1993)! having 195 amino acids and a molecular weight of approximately 21 kDa. Identity was particularly high between nucleotides 950-1515 of clone 5 which were 95-99% identical to regions of TABP. TABP lacks an actin binding domain and 22 of 24 tandem repeats found in filamin, but contains sequences homologous to the carboxy terminal repeats numbered 23 and 24 found in filamin. The TABP hinge region, between repeats 23 and 24, contains a putative glycoprotein binding site and a Ca²⁺ /calmodulin kinase II phosphorylation site Leedman, supra!. TABP is encoded by a 2.3 kb mRNA and a CDNA encoding TABP was cloned from a thyroid expression library from a Graves disease patient Leedmen, supra!.

In order to obtain a more complete FLP-1 sequence, the human spleen cDNA library was screened using S3 as a probe. The S3 clone was digested with EcoRI and a 1.2 kb fragment was isolated and labeled using the Random Primed Labeling Kit (Boehringer Mannheim, Indianapolis, Ind.) according to the manufacturer's suggested protocol. Unincorporated nucleotides were removed using a Centrisep column (Princeton Separations, Adelphia, N.J.). The probe was added to filters in hybridization solution (5× SSPE, 45% formamide, 5× Denhardts, 1% SDS) and hybridized overnight at 42° C. The filters were washed at a final stringency of 0.2× SSC/0.1% SDS at 65° C.

Primary positive clones were picked, diluted and replated on Hybond N⁺ filters on LBM plates. Two duplicate filters were rehybridized with hybridization solution saved from the original hybridization described supra. Clones which were positive on both filters were picked, grown and their plasmids isolated and sequenced by standard methods.

Ten FLP-1 positive clones were detected and sequence data from these clones was compared to filamin and FLP-1 sequences derived from clone S3 and S5. Overlap of sequences from clones S3 and S5 with sequences from clones F3, F5 and F7 permitted determination of a more complete sequence for FLP-1, the more complete nucleotide and amino acid sequences set out in SEQ ID NOs: 1 and 2, respectively. In SEQ ID NO: 1, nucleotides 1-315 were derived from clone F5; nucleotides 316-738 from clone F3; nucleotides 739-816 from clone F7; nucleotides 817-1122 from clone S3 and nucleotides 1123-2574 from clone S5.

EXAMPLE 3

Tissue Specific Expression of FLP-1

In order to determine size of a mRNA encoding FLP-1 in various tissues, a human immune system multiple tissue northern (Clontech) was probed with a random-primed portion of clone S3 (corresponding to nucleotides 255-777 in SEQ ID NO: 9) according to manufacturer's suggested protocol.

In spleen, lymph node, thymus, bone marrow, and fetal liver, mRNA of two distinct sizes hybridized to the FLP-1 probe: one just above and one just below the 9.5Kb size marker. In appendix and peripheral blood leukocytes, only one band, just below the 9.5Kb size marker, hybridized with the FLP-1 probe. These results suggest that the FLP-1 mRNA encodes a protein similar in size to filamin as reported in Gorlin, supra.

EXAMPLE 4

Specificity of Filamin/β₇ and FLP-1/β₇ Interaction

The specificity of the interactions of filamin and FLP-1 with the β₇ integrin cytoplasmic tail was verified by transforming filamin clone 1271 and FLP-1 clone S5 into Y190 strains containing any one of a variety of "baits" vectors (encoding either β₁, β₂, β₇ α_(L), or α₄ integrin cytoplasmic tails) using standard methods described supra. Results from this assay, shown in Table 1, indicated that filamin and FLP-1 specifically bind to β₇ integrins and not to other integrins.

                  TABLE 1     ______________________________________     Binding Specificity of Filamin and FLP-1     SPECIFICITY OF INTERACTION     INTEGRIN "BAIT"   FILAMIN  FLP-1     ______________________________________     β.sub.1      -        -     β.sub.2      -        -     β.sub.7      +        +     α.sub.L     -        -     α.sub.4     -        -     ______________________________________

In vivo interaction between filamin and β₇ integrin was also investigated by co-precipitation of a filamin/α₄ β₇ complex from JY cells, which express endogenous α₄ β₇ -Cells were initially permeabilized with 10 μg/ml lysolecithin (Sigma, St. Louis, Mo.) in PBS, pH 8.0, with 1 mM Ca⁺⁺ and 1 mM Mg⁺⁺, for five minutes. Cellular proteins were crosslinked using DTSSP (921 μM) and labeled with biotin as described in Altin, et al., Anal. Biochem. 224:382-389 (1995). Crosslinked cells were lysed using 1% Triton-X100 and integrins were immunoprecipitated using monoclonal antibodies immunospecific for α₄ (antibody HP2/1, Immunotech, Westbrook, Me., or antibody B5G10, Upstate Biotechnology, Inc., Lake Placid, N.Y.), or β₂ (antibody 23 IIIb). A control antibody, PC21 (Sigma, St. Louis, Mo.) was also employed. Precipitated proteins were separated on a 6% SDS-PAGE gel, transferred to an Immobilon P membrane and probed with filamin antisera (Chemicon International, Inc. Temecula, Calif.).

Consistent with the hybrid screen, results demonstrate filamin co-precipitation with an α₄ integrin, however in this assay, full length filamin also co-precipitated with the β₂ subunit.

EXAMPLE 5

Localization of FLP-1 or Filamin Binding on β₇

In order to more fully characterize the binding between FLP-1 or filamin and β₇ integrin, the two-hybrid assay was employed using various deletion derivatives of either of the individual binding partners.

Several cytoplasmic domain mutants of the β₇ integrin were created using PCR in order to map the site(s) of interaction observed as described above. Filamin clone 1271 and FLP-1 clone S5 were employed to evaluate the degree to which mutations in the β₇ cytoplasmic domain affected binding. Following standard co-transformations of Y190 as described above, binding interactions were determined by β-gal assay, as described above. The β₇ deletions and substitutions are set out in SEQ ID NOS: 14 to 18 below, and compared to the native β₇ sequence set out in SEQ ID NO: 13. In each expression construct, only the cytoplasmic portion of β₇, or a truncation thereof, was subcloned.

    __________________________________________________________________________     β.sub.7         RLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL                                         (SEQ ID NO: 13)     β.sub.7 D1        RLSVEIYDRREYSRFEKEQQQLNWKQDSNP   (SEQ ID NO: 14)     β.sub.7 D2        RLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSA                                         (SEQ ID NO: 15)     β.sub.7 D3        RLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINP                                         (SEQ ID NO: 16)     β.sub.7 D4        RLSVEIYDRREYSRFEKE               (SEQ ID NO: 17)     β.sub.7 D5        RLSVEIYDRREYSR                   (SEQ ID NO: 18)     __________________________________________________________________________

Primers used to generate the various deletion mutants are set out in SEQ ID NOs: 19 to 23, below, and were individually utilized in an amplification reaction pairs with the primer set out in SEQ ID NO: 3. Reaction conditions were as described in Example 1.

    __________________________________________________________________________     NHβ.sub.7 D1            GATGGCACTTTTGTACTAAGGATTACTGTCCTG                                 (SEQ ID NO: 19)     NHβ.sub.7 D2            ATTGATGGTGGTCGTCTAGGCACTTTTGTAGAG                                 (SEQ ID NO: 20)     NHβ.sub.7 D3            GTCTGCCTCTTGAAACTAAGGATTGATGGTGGT                                 (SEQ ID NO: 21)     NHβ.sub.7 D4            CCAGTTGAGTTGTTGCTACTCCTTCTCAAAGCG                                 (SEQ ID NO: 22)     NHβ.sub.7 D5            GTTGCTGCTCCTTCTCCTAGCGACTGTATTCCCG                                 (SEQ ID NO: 23)     __________________________________________________________________________

In addition, a series of β₇ substitution mutants were also constructed wherein the sequence changes are set out in SEQ ID NOs: 24 to 27, with the substituted amino acid residue underlined.

    __________________________________________________________________________     β.sub.7 S3A         RLAVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL                                         (SEQ ID NO: 24)     β.sub.7 E5Q         RLSVQIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL                                         (SEQ ID NO: 25)     β.sub.7 R9A         RLSVEIYDAREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL                                         (SEQ ID NO: 26)     β.sub.7 S13A         RLSVEIYDRREYARFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL                                         (SEQ ID NO: 27)     __________________________________________________________________________

Oligonuleotides used to generate the various substitution variants are set out in SEQ ID NOs: 28 to 31, infra.

    __________________________________________________________________________     B7S3A  GTCATAGATTTCCACCGCGAGCCGGTATCCGAG                                  (SEQ ID NO: 28)     B7E5Q  CCGGCCGTCATAGATTTGCACCGAGAGCCGGTATC                                  (SEQ ID NO: 29)     B7R9A  GCGACTGTATTCCCGCGCGTCATAGATTTCCAC                                  (SEQ ID NO: 30)     B7S13A CTCCTTCTCAAAGCGCGCGTATTCCCGGCGGTC                                  (SEQ ID NO: 31)     __________________________________________________________________________

Specific truncation mutants of filamin were generated by PCR amplification of existing clones under conditions described in Example 1. Mutant ABPD1 encoded a region including an actin binding domain and a dimerization domain of filamin (amino acid 2487-2644 in SEQ ID NO: 7). Mutant ABPD2 encoded a truncated form of ABPD1 which lacked the filamin dimerization domain (amino acids 2487-2577 in SEQ ID NO: 7). Both mutants were generated by PCR using primers set out in SEQ ID NO: 32 and 33 to generate mutant ABPD1, and primers set out in SEQ ID NO: 32 and 34 to generate ABPD2.

    __________________________________________________________________________     ABP.5x ATATCTCGAGAGTATACCCCCATGGCACCT                                (SEQ ID NO: 32)     ABP.Xho1            ATATCTCGAGTCAGGGCACCACAACGCG                                (SEQ ID NO: 33)     ABP.Xho2            ATATCTCGAGTCAGCTGCTCTTCTGGCCCTAC                                (SEQ ID NO: 34)     __________________________________________________________________________

Primers as described were used in a reaction with filamin clone 1271 under the following amplification conditions: an initial incubation at 94° C. for five minutes, followed by thirty cycles of 94° C. for 30 seconds, 50° C. for 30 seconds, and 72° C. for one minute. The resulting PCR product was cut with Xho1 and ligated into vector pACT (described in Example 1) previously digested with Xho1.

An FLP-1 mutant comprised of amino acid sequences 696 to 857 in SEQ ID NO: 1 and showing identity to TABP (the TABP-like analog) was also generated by PCR amplification (under conditions described in Example 1) from a human spleen cDNA library. The FLP-1 mutant was generated by PCR using the primer pair set out in SEQ ID NO: 35 and 36.

    __________________________________________________________________________     TABP.Nde           ATATCATATGTACACCCCCATGGCTCCT                               (SEQ ID NO: 35)     TABP.Bam           ATAGGATCCTCAGCCCCACAAACAGGC                               (SEQ ID NO: 36)     __________________________________________________________________________

Reactions were carried out using 2.5 μg spleen cDNA under the following amplification conditions: an initial incubation at 94° C. for five minutes, followed by thirty cycles of 94° C. for 30 seconds, 50° C. for 30 seconds, and 72° C. for one minute. The resulting PCR products were digested with NdeI and BamHi and cloned into vector pET previously digested with the same enzymes. The resulting TABP/pET vector was then utilized in a secondary PCR with the PCR primer pair set out in SEQ ID NO: 32 and 33, above, under the following conditions: an initial incubation at 94° C. for five minutes, followed by thirty cycles at 94° for one minute, 50° C. for one minute and 72° C. for two minutes. The resulting PCR product was digested with Xho1 and cloned into pACT previously digested with Xho1. The FLP-1 TABP-like truncate represents the same size and region in filamin as represented by mutant ABPD1.

Results from the two hybrid assay as shown in Table 2 indicated that none of the β₇ cytoplasmic domain mutations completely disrupted interaction with either filamin clone 1271 or FLP-1 clone S5. The substitution mutant β₇ E5Q however appeared to result in a weaker interaction with filamin clone 1271 and ABDP1. Removal of the 3' untranslated region in filamin mutant ABPD1 also did not disrupt the interaction with the β₇ cytoplasmic domain, but removal of the dimerization domain as in filamin mutant ABPD2 abolished filamin/β₇ interaction completely. These data indicate that the filamin dimerization region is critical for interaction with the β₇ cytoplasmic domain.

The FLP-1 TABP-like truncate failed to interact with the β₇ integrin cytoplasmic domain. In addition, none of the β₇ mutants disrupted detectable FLP-1/β₇ interaction.

                  TABLE 2     ______________________________________     INTERACTION OF FILAMIN AND FLP-1     WITH β.sub.7 DELETION AND SUBSTITUTION ANALOGS     FILAMIN      ABPD1   ABPD2     FLP-1 TABP     ______________________________________     β.sub.1           -          -               -     -     β.sub.2           -          -       -       -     -     β.sub.7           +          +/-     -       +     -     β.sub.7 D1           +          +       +/-     -     β.sub.7 D2           +          +       +/-     -     β.sub.7 D3           +          +       -       +     β.sub.7 D4           +          +       -       +     β.sub.7 D5           +          +               +     β.sub.7 S3A           +          +               +     β.sub.7 E5Q           +/-        +/-             +     β.sub.7 R9A           +          +               +     β.sub.7 S13A           +          +               +     α.sub.L           -                          -     -     α.sub.4           -                          -     -     ______________________________________

EXAMPLE 6

Applications for Modulators of Filamin/fi₇ and FLP-1/β₇ Binding

Two β₇ associated integrins have been identified: α₄ β₇ and α_(E) β₇. Both are expressed on a subpopulation of peripheral blood lymphocytes and their expression is inducible. Both are expressed on macrophages but not monocytes and both appear to function in homing or localization of lymphocytes to mucosal tissue see review in Jutila, J. Leukocyte Biol. 55:133-140 (1994)!. The homing properties of α₄ β₇ can be attributed to interaction with MadCAM-1 expressed in mucosal nodes, while the retention of α_(E) β₇ ⁺ cells in the gut is attributed to interactions with epithelial cells expressing E-cadherin. Thus, binding by one or both β₇ integrins to their respective counter-receptor may contribute to mucosal immune responses as well as inflammatory (e.g., inflammatory bowel disease, IBD) and autoimmune responses at this site.

Further, it has been suggested that filamin is important in cell locomotion due to the fact that cells expressing low levels of the protein do not form leading lamella structures required for locomotion. The structural homology of FLP-1 to filamin suggests a similar role for this protein. In view of the observation that integrins can be observed clustered in point contacts, which are also important in cell locomotion, the invention contemplates that β₇ interaction with FLP-1 and/or filamin may be crucial to cell movement, and that disruption of the interactions will be useful, for example, in preventing the homing of ₇ ⁺ cells which occurs in certain pathological inflammatory responses such as IBD.

In order to identify modulators of β₇ /FLP-1 interaction, it is necessary to clearly define the portions of both proteins which are necessary for binding. Amino acid substitution, through standard mutagenesis techniques will permit identification of the binding regions of the proteins. Deletion analysis, wherein truncated forms of either protein are generated, for example by PCR, is also useful for identification of binding regions if the deletion does not disrupt the tertiary or quaternary structure of the protein to the point that it is no longer recognized buy its counter-receptor.

Identification of the significant protein regions involved in binding permits more accurate and efficient screening of putative modulators of binding activity. The invention contemplates of a high throughput screening assay to analyze large libraries of small molecules or peptides, as well as antibodies immunospecific for either or both binding partners, for the ability to modulate binding of β₇ integrins to FLP-1 or filamin. While two hybrid screening, scintillation proximity assays (SPA) and immunological methodologies, for example, enzyme-linked immunosorbent assays (ELISA), disclosed herein are not HTS methods per se, they are amenable to test many of the compounds listed for an ability to modulate binding. SPA and ELISA are particularly useful in this identification process and can be modified to permit high throughput screening of the test compounds described.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 36     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2574 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..2574     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - CCT TTT GAC CTG GTC ATT CCG TTT GCT GTC AG - #G AAA GGA GAA ATC ACT       48     Pro Phe Asp Leu Val Ile Pro Phe Ala Val Ar - #g Lys Gly Glu Ile Thr     #                 15     - GGA GAG GTC CAC ATG CCT TCT GGG AAG ACA GC - #C ACA CCT GAG ATT GTG       96     Gly Glu Val His Met Pro Ser Gly Lys Thr Al - #a Thr Pro Glu Ile Val     #             30     - GAC AAC AAG GAC GGC ACG GTC ACT GTT AGA TA - #T GCC CCC ACT GAG GTC      144     Asp Asn Lys Asp Gly Thr Val Thr Val Arg Ty - #r Ala Pro Thr Glu Val     #         45     - GGG CTC CAT GAG ATG CAC ATC AAA TAC ATG GG - #C AGC CAC ATC CCT GAG      192     Gly Leu His Glu Met His Ile Lys Tyr Met Gl - #y Ser His Ile Pro Glu     #     60     - AGC CCA CTC CAG TTC TAC GTG AAC TAC CCC AA - #C AGT GGA AGT GTT TCT      240     Ser Pro Leu Gln Phe Tyr Val Asn Tyr Pro As - #n Ser Gly Ser Val Ser     # 80     - GCA TAC GGT CCA GGC CTC GTG TAT GGA GTG GC - #C AAC AAA ACT GCC ACC      288     Ala Tyr Gly Pro Gly Leu Val Tyr Gly Val Al - #a Asn Lys Thr Ala Thr     #                 95     - TTC ACC ATC GTC ACA GAG GAT GCA GGA GAA GG - #T GGT CTG GAC TTG GCT      336     Phe Thr Ile Val Thr Glu Asp Ala Gly Glu Gl - #y Gly Leu Asp Leu Ala     #           110     - ATT GAG GGC CCC TCA AAA GCA GAA ATC AGC TG - #C ATT GAC AAT AAA GAT      384     Ile Glu Gly Pro Ser Lys Ala Glu Ile Ser Cy - #s Ile Asp Asn Lys Asp     #       125     - GGG ACA TGC ACA GTG ACC TAC CTG CCG ACT CT - #G CCA GGC GAC TAC AGC      432     Gly Thr Cys Thr Val Thr Tyr Leu Pro Thr Le - #u Pro Gly Asp Tyr Ser     #   140     - ATT CTG GTC AAG TAC AAT GAC AAG CAC ATC CC - #T GGC AGC CCC TTC ACA      480     Ile Leu Val Lys Tyr Asn Asp Lys His Ile Pr - #o Gly Ser Pro Phe Thr     145                 1 - #50                 1 - #55                 1 -     #60     - GCC AAG ATC ACA GAT GAC AGC AGG CGG TGC TC - #C CAG GTG AAG TTG GGC      528     Ala Lys Ile Thr Asp Asp Ser Arg Arg Cys Se - #r Gln Val Lys Leu Gly     #               175     - TCA GCC GCT GAC TTC CTG CTC GAC ATC AGT GA - #G ACT GAC CTC AGC AGC      576     Ser Ala Ala Asp Phe Leu Leu Asp Ile Ser Gl - #u Thr Asp Leu Ser Ser     #           190     - CTG ACG GCC AGC ATT AAG GCC CCA TCT GGC CG - #A GAC GAG CCC TGT CTC      624     Leu Thr Ala Ser Ile Lys Ala Pro Ser Gly Ar - #g Asp Glu Pro Cys Leu     #       205     - CTG AAG AGG CTG CCC AAC AAC CAC ATT GGC AT - #C TCC TTC ATC CCC CGG      672     Leu Lys Arg Leu Pro Asn Asn His Ile Gly Il - #e Ser Phe Ile Pro Arg     #   220     - GAA GTG GGC GAA CAT CTG GTC AGC ATC AAG AA - #A AAT GGC AAC CAT GTG      720     Glu Val Gly Glu His Leu Val Ser Ile Lys Ly - #s Asn Gly Asn His Val     225                 2 - #30                 2 - #35                 2 -     #40     - GCC AAC AGC CCC GTG TCT ATC ATG GTG GTC CA - #G TCG GAG ATT GGT GAC      768     Ala Asn Ser Pro Val Ser Ile Met Val Val Gl - #n Ser Glu Ile Gly Asp     #               255     - GCC CGC CGA GCC AAA GTC TAT GGC CGC GGC CT - #G TCA GAA GGC CGG ACT      816     Ala Arg Arg Ala Lys Val Tyr Gly Arg Gly Le - #u Ser Glu Gly Arg Thr     #           270     - TTC GAG ATG TCT GAC TTC ATC GTG GAC ACA AG - #G GAT GCA GGT TAT GGT      864     Phe Glu Met Ser Asp Phe Ile Val Asp Thr Ar - #g Asp Ala Gly Tyr Gly     #       285     - GGC ATA TCC TTG GCG GTG GAA GGC CCC AGC AA - #A GTG GAC ATC CAG ACG      912     Gly Ile Ser Leu Ala Val Glu Gly Pro Ser Ly - #s Val Asp Ile Gln Thr     #   300     - GAG GAC CTG GAA GAT GGC ACC TGC AAA GTC TC - #C TAC TTC CCT ACC GTG      960     Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Se - #r Tyr Phe Pro Thr Val     305                 3 - #10                 3 - #15                 3 -     #20     - CCT GGG GTT TAT ATC GTC TCC ACC AAA TTC GC - #T GAC GAG CAC GTG CCT     1008     Pro Gly Val Tyr Ile Val Ser Thr Lys Phe Al - #a Asp Glu His Val Pro     #               335     - GGG AGC CCA TTT ACC GTG AAG ATC AGT GGG GA - #G GGA AGA GTC AAA GAG     1056     Gly Ser Pro Phe Thr Val Lys Ile Ser Gly Gl - #u Gly Arg Val Lys Glu     #           350     - AGC ATC ACC CGC ACC AGT CGG GCC CCG TCC GT - #G GCC ACT GTC GGG AGC     1104     Ser Ile Thr Arg Thr Ser Arg Ala Pro Ser Va - #l Ala Thr Val Gly Ser     #       365     - ATT TGT GAC CTG AAC CTG AAA ATC CCA GAA AT - #C AAC AGC AGT GAT ATG     1152     Ile Cys Asp Leu Asn Leu Lys Ile Pro Glu Il - #e Asn Ser Ser Asp Met     #   380     - TCG GCC CAC GTC ACC AGC CCC TCT GGC CGT GT - #G ACT GAG GCA GAG ATT     1200     Ser Ala His Val Thr Ser Pro Ser Gly Arg Va - #l Thr Glu Ala Glu Ile     385                 3 - #90                 3 - #95                 4 -     #00     - GTG CCC ATG GGG AAG AAC TCA CAC TGC GTC CG - #G TTT GTG CCC CAG GAG     1248     Val Pro Met Gly Lys Asn Ser His Cys Val Ar - #g Phe Val Pro Gln Glu     #               415     - ATG GGC GTG CAC ACG GTC AGC GTC AAG TAC CG - #T GGG CAG CAC GTC ACC     1296     Met Gly Val His Thr Val Ser Val Lys Tyr Ar - #g Gly Gln His Val Thr     #           430     - GGC AGC CCC TTC CAG TTC ACC GTG GGG GCA CT - #T GGT GAA GGA GGC GCC     1344     Gly Ser Pro Phe Gln Phe Thr Val Gly Ala Le - #u Gly Glu Gly Gly Ala     #       445     - CAC AAG GTG CGG GCA GGA GGC CCT GGC CTG GA - #G AGA GGA GAA GCG GGA     1392     His Lys Val Arg Ala Gly Gly Pro Gly Leu Gl - #u Arg Gly Glu Ala Gly     #   460     - GTC CCA GCT GAG TTC AGC ATT TGG ACC CGG GA - #A GCA GGC GCT GGA GGC     1440     Val Pro Ala Glu Phe Ser Ile Trp Thr Arg Gl - #u Ala Gly Ala Gly Gly     465                 4 - #70                 4 - #75                 4 -     #80     - CTC TCC ATC GCT GTT GAG GGC CCC AGT AAG GC - #C GAG ATT ACA TTC GAT     1488     Leu Ser Ile Ala Val Glu Gly Pro Ser Lys Al - #a Glu Ile Thr Phe Asp     #               495     - GAC CAT AAA AAT GGG TCG TGC GGT GTA TCT TA - #T ATT GCC CAA GAG CCT     1536     Asp His Lys Asn Gly Ser Cys Gly Val Ser Ty - #r Ile Ala Gln Glu Pro     #           510     - GGT AAC TAC GAG GTG TCC ATC AAG TTC AAT GA - #T GAG CAC ATC CCG GAA     1584     Gly Asn Tyr Glu Val Ser Ile Lys Phe Asn As - #p Glu His Ile Pro Glu     #       525     - AGC CCC TAC CTG GTG CCG GTC ATC GCA CCC TC - #C GAC GAC GCC CGC CGC     1632     Ser Pro Tyr Leu Val Pro Val Ile Ala Pro Se - #r Asp Asp Ala Arg Arg     #   540     - CTC ACT GTT ATG AGC CTT CAG GAA TCG GGA TT - #A AAA GTT AAC CAG CCA     1680     Leu Thr Val Met Ser Leu Gln Glu Ser Gly Le - #u Lys Val Asn Gln Pro     545                 5 - #50                 5 - #55                 5 -     #60     - GCA TCC TTT GCT ATA AGG TTG AAT GGC GCA AA - #A GGC AAG ATT GAT GCA     1728     Ala Ser Phe Ala Ile Arg Leu Asn Gly Ala Ly - #s Gly Lys Ile Asp Ala     #               575     - AAG GTG CAC AGC CCC TCT GGA GCC GTG GAG GA - #G TGC CAC GTG TCT GAG     1776     Lys Val His Ser Pro Ser Gly Ala Val Glu Gl - #u Cys His Val Ser Glu     #           590     - CTG GAG CCA GAT AAG TAT GCT GTT CGC TTC AT - #C CCT CAT GAG AAT GGT     1824     Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe Il - #e Pro His Glu Asn Gly     #       605     - GTC CAC ACC ATC GAT GTC AAG TTC AAT GGG AG - #C CAC GTG GTT GGA AGC     1872     Val His Thr Ile Asp Val Lys Phe Asn Gly Se - #r His Val Val Gly Ser     #   620     - CCC TTC AAA GTG CGC GTT GGG GAG CCT GGA CA - #A GCG GGG AAC CCT GCC     1920     Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gl - #n Ala Gly Asn Pro Ala     625                 6 - #30                 6 - #35                 6 -     #40     - CTG GTG TCC GCC TAT GGC ACG GGA CTC GAA GG - #G GGN ACC ACA GGT ATC     1968     Leu Val Ser Ala Tyr Gly Thr Gly Leu Glu Gl - #y Xaa Thr Thr Gly Ile     #               655     - CAG TCG GAA TTC TTT ATT AAC ACC ACC CGA GC - #A GGT CCA GGG ACA TTA     2016     Gln Ser Glu Phe Phe Ile Asn Thr Thr Arg Al - #a Gly Pro Gly Thr Leu     #           670     - TCC GTC ACC ATC GAA GGC CCA TCC AAG GTT AA - #A ATG GAT TGC CAG GAA     2064     Ser Val Thr Ile Glu Gly Pro Ser Lys Val Ly - #s Met Asp Cys Gln Glu     #       685     - ACA CCT GAA GGG TAC AAA GTC ATG TAC ACC CC - #C ATG GCT CCT GGT AAC     2112     Thr Pro Glu Gly Tyr Lys Val Met Tyr Thr Pr - #o Met Ala Pro Gly Asn     #   700     - TAC CTG ATC AGT GTC AAA TAC GGT GGG CCC AA - #C CAC ATC GTG GGC AGT     2160     Tyr Leu Ile Ser Val Lys Tyr Gly Gly Pro As - #n His Ile Val Gly Ser     705                 7 - #10                 7 - #15                 7 -     #20     - CCC TTC AAG GCC AAG GTG ACT GGC CAG CGT CT - #A GTT AGC CCT GGC TCA     2208     Pro Phe Lys Ala Lys Val Thr Gly Gln Arg Le - #u Val Ser Pro Gly Ser     #               735     - GCC AAC GAG ACC TCA TCC ATC CTG GTG GAG TC - #A GTG ACC AGG TCG TCT     2256     Ala Asn Glu Thr Ser Ser Ile Leu Val Glu Se - #r Val Thr Arg Ser Ser     #           750     - ACA GAG ACC TGC TAT AGC GCC ATT CCC AAG GC - #A TCC TCG GAC GCC AGC     2304     Thr Glu Thr Cys Tyr Ser Ala Ile Pro Lys Al - #a Ser Ser Asp Ala Ser     #       765     - AAG GTG ACC TCT AAG GGG GCA GGG CTC TCA AA - #G GCC TTT GTG GGC CAG     2352     Lys Val Thr Ser Lys Gly Ala Gly Leu Ser Ly - #s Ala Phe Val Gly Gln     #   780     - AAG AGT TCC TTC CTG GTG GAC TGC AGC AAA GC - #T GGC TCC AAC ATG CTG     2400     Lys Ser Ser Phe Leu Val Asp Cys Ser Lys Al - #a Gly Ser Asn Met Leu     785                 7 - #90                 7 - #95                 8 -     #00     - CTG ATC GGG GTC CAT GGG CCC ACC ACC CCC TG - #C GAG GAG GTC TCC ATG     2448     Leu Ile Gly Val His Gly Pro Thr Thr Pro Cy - #s Glu Glu Val Ser Met     #               815     - AAG CAT GTA GGC AAC CAG CAA TAC AAC GTC AC - #A TAC GTC GTC AAG GAG     2496     Lys His Val Gly Asn Gln Gln Tyr Asn Val Th - #r Tyr Val Val Lys Glu     #           830     - AGG GGC GAT TAT GTG CTG GCT GTG AAG TGG GG - #G GAG GAA CAC ATC CCT     2544     Arg Gly Asp Tyr Val Leu Ala Val Lys Trp Gl - #y Glu Glu His Ile Pro     #       845     #         2574     AT GTC ACA GTG CCT TAA     Gly Ser Pro Phe His Val Thr Val Pro Tyr     #   855     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 858 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     - Pro Phe Asp Leu Val Ile Pro Phe Ala Val Ar - #g Lys Gly Glu Ile Thr     #                 15     - Gly Glu Val His Met Pro Ser Gly Lys Thr Al - #a Thr Pro Glu Ile Val     #             30     - Asp Asn Lys Asp Gly Thr Val Thr Val Arg Ty - #r Ala Pro Thr Glu Val     #         45     - Gly Leu His Glu Met His Ile Lys Tyr Met Gl - #y Ser His Ile Pro Glu     #     60     - Ser Pro Leu Gln Phe Tyr Val Asn Tyr Pro As - #n Ser Gly Ser Val Ser     # 80     - Ala Tyr Gly Pro Gly Leu Val Tyr Gly Val Al - #a Asn Lys Thr Ala Thr     #                 95     - Phe Thr Ile Val Thr Glu Asp Ala Gly Glu Gl - #y Gly Leu Asp Leu Ala     #           110     - Ile Glu Gly Pro Ser Lys Ala Glu Ile Ser Cy - #s Ile Asp Asn Lys Asp     #       125     - Gly Thr Cys Thr Val Thr Tyr Leu Pro Thr Le - #u Pro Gly Asp Tyr Ser     #   140     - Ile Leu Val Lys Tyr Asn Asp Lys His Ile Pr - #o Gly Ser Pro Phe Thr     145                 1 - #50                 1 - #55                 1 -     #60     - Ala Lys Ile Thr Asp Asp Ser Arg Arg Cys Se - #r Gln Val Lys Leu Gly     #               175     - Ser Ala Ala Asp Phe Leu Leu Asp Ile Ser Gl - #u Thr Asp Leu Ser Ser     #           190     - Leu Thr Ala Ser Ile Lys Ala Pro Ser Gly Ar - #g Asp Glu Pro Cys Leu     #       205     - Leu Lys Arg Leu Pro Asn Asn His Ile Gly Il - #e Ser Phe Ile Pro Arg     #   220     - Glu Val Gly Glu His Leu Val Ser Ile Lys Ly - #s Asn Gly Asn His Val     225                 2 - #30                 2 - #35                 2 -     #40     - Ala Asn Ser Pro Val Ser Ile Met Val Val Gl - #n Ser Glu Ile Gly Asp     #               255     - Ala Arg Arg Ala Lys Val Tyr Gly Arg Gly Le - #u Ser Glu Gly Arg Thr     #           270     - Phe Glu Met Ser Asp Phe Ile Val Asp Thr Ar - #g Asp Ala Gly Tyr Gly     #       285     - Gly Ile Ser Leu Ala Val Glu Gly Pro Ser Ly - #s Val Asp Ile Gln Thr     #   300     - Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Se - #r Tyr Phe Pro Thr Val     305                 3 - #10                 3 - #15                 3 -     #20     - Pro Gly Val Tyr Ile Val Ser Thr Lys Phe Al - #a Asp Glu His Val Pro     #               335     - Gly Ser Pro Phe Thr Val Lys Ile Ser Gly Gl - #u Gly Arg Val Lys Glu     #           350     - Ser Ile Thr Arg Thr Ser Arg Ala Pro Ser Va - #l Ala Thr Val Gly Ser     #       365     - Ile Cys Asp Leu Asn Leu Lys Ile Pro Glu Il - #e Asn Ser Ser Asp Met     #   380     - Ser Ala His Val Thr Ser Pro Ser Gly Arg Va - #l Thr Glu Ala Glu Ile     385                 3 - #90                 3 - #95                 4 -     #00     - Val Pro Met Gly Lys Asn Ser His Cys Val Ar - #g Phe Val Pro Gln Glu     #               415     - Met Gly Val His Thr Val Ser Val Lys Tyr Ar - #g Gly Gln His Val Thr     #           430     - Gly Ser Pro Phe Gln Phe Thr Val Gly Ala Le - #u Gly Glu Gly Gly Ala     #       445     - His Lys Val Arg Ala Gly Gly Pro Gly Leu Gl - #u Arg Gly Glu Ala Gly     #   460     - Val Pro Ala Glu Phe Ser Ile Trp Thr Arg Gl - #u Ala Gly Ala Gly Gly     465                 4 - #70                 4 - #75                 4 -     #80     - Leu Ser Ile Ala Val Glu Gly Pro Ser Lys Al - #a Glu Ile Thr Phe Asp     #               495     - Asp His Lys Asn Gly Ser Cys Gly Val Ser Ty - #r Ile Ala Gln Glu Pro     #           510     - Gly Asn Tyr Glu Val Ser Ile Lys Phe Asn As - #p Glu His Ile Pro Glu     #       525     - Ser Pro Tyr Leu Val Pro Val Ile Ala Pro Se - #r Asp Asp Ala Arg Arg     #   540     - Leu Thr Val Met Ser Leu Gln Glu Ser Gly Le - #u Lys Val Asn Gln Pro     545                 5 - #50                 5 - #55                 5 -     #60     - Ala Ser Phe Ala Ile Arg Leu Asn Gly Ala Ly - #s Gly Lys Ile Asp Ala     #               575     - Lys Val His Ser Pro Ser Gly Ala Val Glu Gl - #u Cys His Val Ser Glu     #           590     - Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe Il - #e Pro His Glu Asn Gly     #       605     - Val His Thr Ile Asp Val Lys Phe Asn Gly Se - #r His Val Val Gly Ser     #   620     - Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gl - #n Ala Gly Asn Pro Ala     625                 6 - #30                 6 - #35                 6 -     #40     - Leu Val Ser Ala Tyr Gly Thr Gly Leu Glu Gl - #y Xaa Thr Thr Gly Ile     #               655     - Gln Ser Glu Phe Phe Ile Asn Thr Thr Arg Al - #a Gly Pro Gly Thr Leu     #           670     - Ser Val Thr Ile Glu Gly Pro Ser Lys Val Ly - #s Met Asp Cys Gln Glu     #       685     - Thr Pro Glu Gly Tyr Lys Val Met Tyr Thr Pr - #o Met Ala Pro Gly Asn     #   700     - Tyr Leu Ile Ser Val Lys Tyr Gly Gly Pro As - #n His Ile Val Gly Ser     705                 7 - #10                 7 - #15                 7 -     #20     - Pro Phe Lys Ala Lys Val Thr Gly Gln Arg Le - #u Val Ser Pro Gly Ser     #               735     - Ala Asn Glu Thr Ser Ser Ile Leu Val Glu Se - #r Val Thr Arg Ser Ser     #           750     - Thr Glu Thr Cys Tyr Ser Ala Ile Pro Lys Al - #a Ser Ser Asp Ala Ser     #       765     - Lys Val Thr Ser Lys Gly Ala Gly Leu Ser Ly - #s Ala Phe Val Gly Gln     #   780     - Lys Ser Ser Phe Leu Val Asp Cys Ser Lys Al - #a Gly Ser Asn Met Leu     785                 7 - #90                 7 - #95                 8 -     #00     - Leu Ile Gly Val His Gly Pro Thr Thr Pro Cy - #s Glu Glu Val Ser Met     #               815     - Lys His Val Gly Asn Gln Gln Tyr Asn Val Th - #r Tyr Val Val Lys Glu     #           830     - Arg Gly Asp Tyr Val Leu Ala Val Lys Trp Gl - #y Glu Glu His Ile Pro     #       845     - Gly Ser Pro Phe His Val Thr Val Pro Tyr     #   855     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     #           30     GGCT CTCGGTGAAG     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     #           30     GGGA CTGTCTGCCT     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 545 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..534     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     - AAG GTG AAG ATG GAT TGC CAG GAG TGC CCT GA - #G GGC TAC CGC GTC ACC       48     Lys Val Lys Met Asp Cys Gln Glu Cys Pro Gl - #u Gly Tyr Arg Val Thr     #                 15     - TAT ACC CCC ATG GCA CCT GGC AGC TAC CTC AT - #C TCC ATC AAG TAC GGC       96     Tyr Thr Pro Met Ala Pro Gly Ser Tyr Leu Il - #e Ser Ile Lys Tyr Gly     #             30     - GGC CCC TAC CAC ATT GGG GGC AGC CCC TTC AA - #G GCC AAA GTC ACA GGC      144     Gly Pro Tyr His Ile Gly Gly Ser Pro Phe Ly - #s Ala Lys Val Thr Gly     #         45     - CCC CGT CTC GTC AGC AAC CAC AGC CTC CAC GA - #G ACA TCA TCA GTG TTT      192     Pro Arg Leu Val Ser Asn His Ser Leu His Gl - #u Thr Ser Ser Val Phe     #     60     - GTA GAC TCT CTG ACC AAG GCC ACC TGT GCC CC - #C CAG CAT GGG GCC CCG      240     Val Asp Ser Leu Thr Lys Ala Thr Cys Ala Pr - #o Gln His Gly Ala Pro     # 80     - GGT CCT GGG CCT GCT GAC GCC AGC AAG GTG GT - #G GCC AAG GGC CTG GGG      288     Gly Pro Gly Pro Ala Asp Ala Ser Lys Val Va - #l Ala Lys Gly Leu Gly     #                 95     - CTG AGC AAG GCC TAC GTA GGC CAG AAG AGC AG - #C TTC ACA GTA GAC TGC      336     Leu Ser Lys Ala Tyr Val Gly Gln Lys Ser Se - #r Phe Thr Val Asp Cys     #           110     - AGC AAA GCA GGC AAC AAC ATG CTG CTG GTG GG - #G GTT CAT GGC CCA AGG      384     Ser Lys Ala Gly Asn Asn Met Leu Leu Val Gl - #y Val His Gly Pro Arg     #       125     - ACC CCC TGC GAG GAG ATC CTG GTG AAG CAC GT - #G GGC AGC CGG CTC TAC      432     Thr Pro Cys Glu Glu Ile Leu Val Lys His Va - #l Gly Ser Arg Leu Tyr     #   140     - AGC GTG TCC TAC CTG CTC AAG GAC AAG GGG GA - #G TAC ACA CTG GTG GTC      480     Ser Val Ser Tyr Leu Leu Lys Asp Lys Gly Gl - #u Tyr Thr Leu Val Val     145                 1 - #50                 1 - #55                 1 -     #60     - AAA TGG GGG GAC GAG CAC ATC CCA GGC AGN CC - #C TAC CGN GTT GTG GTG      528     Lys Trp Gly Asp Glu His Ile Pro Gly Xaa Pr - #o Tyr Xaa Val Val Val     #               175     #  545             CC     Pro     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 177 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     - Lys Val Lys Met Asp Cys Gln Glu Cys Pro Gl - #u Gly Tyr Arg Val Thr     #                 15     - Tyr Thr Pro Met Ala Pro Gly Ser Tyr Leu Il - #e Ser Ile Lys Tyr Gly     #             30     - Gly Pro Tyr His Ile Gly Gly Ser Pro Phe Ly - #s Ala Lys Val Thr Gly     #         45     - Pro Arg Leu Val Ser Asn His Ser Leu His Gl - #u Thr Ser Ser Val Phe     #     60     - Val Asp Ser Leu Thr Lys Ala Thr Cys Ala Pr - #o Gln His Gly Ala Pro     # 80     - Gly Pro Gly Pro Ala Asp Ala Ser Lys Val Va - #l Ala Lys Gly Leu Gly     #                 95     - Leu Ser Lys Ala Tyr Val Gly Gln Lys Ser Se - #r Phe Thr Val Asp Cys     #           110     - Ser Lys Ala Gly Asn Asn Met Leu Leu Val Gl - #y Val His Gly Pro Arg     #       125     - Thr Pro Cys Glu Glu Ile Leu Val Lys His Va - #l Gly Ser Arg Leu Tyr     #   140     - Ser Val Ser Tyr Leu Leu Lys Asp Lys Gly Gl - #u Tyr Thr Leu Val Val     145                 1 - #50                 1 - #55                 1 -     #60     - Lys Trp Gly Asp Glu His Ile Pro Gly Xaa Pr - #o Tyr Xaa Val Val Val     #               175     - Pro     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 8367 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 172..8115     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     - CGATCCGGGC GCCACCCCGC GGTCATCGGT CACCGGTCGC TCTCAGGAAC AG - #CAGCGCAA       60     - CCTCTGCTCC CTGCCTCGCC TCCCGCGCGC CTAGGTGCCT GCGACTTTAA TT - #AAAGGGCC      120     #ATG AGT     177GGCTGCA GCACCGCCCC CCCGGCTTCT CGCGCCTCAA A     #   Met Ser     #     1     - AGC TCC CAC TCT CGG GCG GGC CAG AGC GCA GC - #A GGC GCG GCT CCG GGC      225     Ser Ser His Ser Arg Ala Gly Gln Ser Ala Al - #a Gly Ala Ala Pro Gly     #          15     - GGC GGC GTC GAC ACG CGG GAC GCC GAG ATG CC - #G GCC ACC GAG AAG GAC      273     Gly Gly Val Asp Thr Arg Asp Ala Glu Met Pr - #o Ala Thr Glu Lys Asp     #     30     - CTG GCG GAG GAC GCG CCG TGG AAG AAG ATC CA - #G CAG AAC ACT TTC ACG      321     Leu Ala Glu Asp Ala Pro Trp Lys Lys Ile Gl - #n Gln Asn Thr Phe Thr     # 50     - CGC TGG TGC AAC GAG CAC CTG AAG TGC GTG AG - #C AAG CGC ATC GCC AAC      369     Arg Trp Cys Asn Glu His Leu Lys Cys Val Se - #r Lys Arg Ile Ala Asn     #                 65     - CTG CAG ACG GAC CTG AGC GAC GGG CTG CGG CT - #T ATC GCG CTG TTG GAG      417     Leu Gln Thr Asp Leu Ser Asp Gly Leu Arg Le - #u Ile Ala Leu Leu Glu     #             80     - GTG CTC AGC CAG AAG AAG ATG CAC CGC AAG CA - #C AAC CAG CGG CCC ACT      465     Val Leu Ser Gln Lys Lys Met His Arg Lys Hi - #s Asn Gln Arg Pro Thr     #         95     - TTC CGC CAA ATG CAG CTT GAG AAC GTG TCG GT - #G GCG CTC GAG TTC CTG      513     Phe Arg Gln Met Gln Leu Glu Asn Val Ser Va - #l Ala Leu Glu Phe Leu     #   110     - GAC CGC GAG AGC ATC AAA CTG GTG TCC ATC GA - #C AGC AAG GCC ATC GTG      561     Asp Arg Glu Ser Ile Lys Leu Val Ser Ile As - #p Ser Lys Ala Ile Val     115                 1 - #20                 1 - #25                 1 -     #30     - GAC GGG AAC CTG AAG CTG ATC CTG GGC CTC AT - #C TGG ACC CTG ATC CTG      609     Asp Gly Asn Leu Lys Leu Ile Leu Gly Leu Il - #e Trp Thr Leu Ile Leu     #               145     - CAC TAC TCC ATC TCC ATG CCC ATG TGG GAC GA - #G GAG GAG GAT GAG GAG      657     His Tyr Ser Ile Ser Met Pro Met Trp Asp Gl - #u Glu Glu Asp Glu Glu     #           160     - GCC AAG AAG CAG ACC CCC AAG CAG AGG CTC CT - #G GGC TGG ATC CAG AAC      705     Ala Lys Lys Gln Thr Pro Lys Gln Arg Leu Le - #u Gly Trp Ile Gln Asn     #       175     - AAG CTG CCG CAG CTG CCC ATC ACC AAC TTC AG - #C CGG GAC TGG CAG AGC      753     Lys Leu Pro Gln Leu Pro Ile Thr Asn Phe Se - #r Arg Asp Trp Gln Ser     #   190     - GGC CGG GCC CTG GGC GCC CTG GTG GAC AGC TG - #T GCC CCG GGC CTG TGT      801     Gly Arg Ala Leu Gly Ala Leu Val Asp Ser Cy - #s Ala Pro Gly Leu Cys     195                 2 - #00                 2 - #05                 2 -     #10     - CCT GAC TGG GAC TCT TGG GAC GCC AGC AAG CC - #C GTT ACC AAT GCG CGA      849     Pro Asp Trp Asp Ser Trp Asp Ala Ser Lys Pr - #o Val Thr Asn Ala Arg     #               225     - GAG GCC ATG CAG CAG GCG GAT GAC TGG CTG GG - #C ATC CCC CAG GTG ATC      897     Glu Ala Met Gln Gln Ala Asp Asp Trp Leu Gl - #y Ile Pro Gln Val Ile     #           240     - ACC CCC GAG GAG ATT GTG GAC CCC AAC GTG GA - #C GAG CAC TCT GTC ATG      945     Thr Pro Glu Glu Ile Val Asp Pro Asn Val As - #p Glu His Ser Val Met     #       255     - ACC TAC CTG TCC CAG TTC CCC AAG GCC AAG CT - #G AAG CCA GGG GCT CCC      993     Thr Tyr Leu Ser Gln Phe Pro Lys Ala Lys Le - #u Lys Pro Gly Ala Pro     #   270     - TTG CGC CCC AAA CTG AAC CCG AAG AAA GCC CG - #T GCC TAC GGG CCA GGC     1041     Leu Arg Pro Lys Leu Asn Pro Lys Lys Ala Ar - #g Ala Tyr Gly Pro Gly     275                 2 - #80                 2 - #85                 2 -     #90     - ATC GAG CCC ACA GGC AAC ATG GTG AAG AAG CG - #G GCA GAG TTC ACT GTG     1089     Ile Glu Pro Thr Gly Asn Met Val Lys Lys Ar - #g Ala Glu Phe Thr Val     #               305     - GAG ACC AGA AGT GCT GGC CAG GGA GAG GTG CT - #G GTG TAC GTG GAG GAC     1137     Glu Thr Arg Ser Ala Gly Gln Gly Glu Val Le - #u Val Tyr Val Glu Asp     #           320     - CCG GCC GGA CAC CAG GAG GAG GCA AAA GTG AC - #C GCC AAT AAC GAC AAG     1185     Pro Ala Gly His Gln Glu Glu Ala Lys Val Th - #r Ala Asn Asn Asp Lys     #       335     - AAC CGC ACC TTC TCC GTC TGG TAC GTC CCC GA - #G GTG ACG GGG ACT CAT     1233     Asn Arg Thr Phe Ser Val Trp Tyr Val Pro Gl - #u Val Thr Gly Thr His     #   350     - AAG GTT ACT GTG CTC TTT GCT GGC CAG CAC AT - #C GCC AAG AGC CCC TTC     1281     Lys Val Thr Val Leu Phe Ala Gly Gln His Il - #e Ala Lys Ser Pro Phe     355                 3 - #60                 3 - #65                 3 -     #70     - GAG GTG TAC GTG GAT AAG TCA CAG GGT GAC GC - #C AGC AAA GTG ACA GCC     1329     Glu Val Tyr Val Asp Lys Ser Gln Gly Asp Al - #a Ser Lys Val Thr Ala     #               385     - CAA GGT CCC GGC CTG GAG CCC AGT GGC AAC AT - #C GCC AAC AAG ACC ACC     1377     Gln Gly Pro Gly Leu Glu Pro Ser Gly Asn Il - #e Ala Asn Lys Thr Thr     #           400     - TAC TTT GAG ATC TTT ACG GCA GGA GCT GGC AC - #G GGC GAG GTC GAG GTT     1425     Tyr Phe Glu Ile Phe Thr Ala Gly Ala Gly Th - #r Gly Glu Val Glu Val     #       415     - GTG ATC CAG GAC CCC ATG GGA CAG AAG GGC AC - #G GTA GAG CCT CAG CTG     1473     Val Ile Gln Asp Pro Met Gly Gln Lys Gly Th - #r Val Glu Pro Gln Leu     #   430     - GAG GCC CGG GGC GAC AGC ACA TAC CGC TGC AG - #C TAC CAG CCC ACC ATG     1521     Glu Ala Arg Gly Asp Ser Thr Tyr Arg Cys Se - #r Tyr Gln Pro Thr Met     435                 4 - #40                 4 - #45                 4 -     #50     - GAG GGC GTC CAC ACC GTG CAC GTC ACG TTT GC - #C GGC GTG CCC ATC CCT     1569     Glu Gly Val His Thr Val His Val Thr Phe Al - #a Gly Val Pro Ile Pro     #               465     - CGC AGC CCC TAC ACT GTC ACT GTT GGC CAA GC - #C TGT AAC CCG AGT GCC     1617     Arg Ser Pro Tyr Thr Val Thr Val Gly Gln Al - #a Cys Asn Pro Ser Ala     #           480     - TGC CGG GCG GTT GGC CGG GGC CTC CAG CCC AA - #G GGT GTG CGG GTG AAG     1665     Cys Arg Ala Val Gly Arg Gly Leu Gln Pro Ly - #s Gly Val Arg Val Lys     #       495     - GAG ACA GCT GAC TTC AAG GTG TAC ACA AAG GG - #C GCT GGC AGT GGG GAG     1713     Glu Thr Ala Asp Phe Lys Val Tyr Thr Lys Gl - #y Ala Gly Ser Gly Glu     #   510     - CTG AAG GTC ACC GTG AAG GGC CCC AAG GGA GA - #G GAG CGC GTG AAG CAG     1761     Leu Lys Val Thr Val Lys Gly Pro Lys Gly Gl - #u Glu Arg Val Lys Gln     515                 5 - #20                 5 - #25                 5 -     #30     - AAG GAC CTG GGG GAT GGC GTG TAT GGC TTC GA - #G TAT TAC CCC ATG GTC     1809     Lys Asp Leu Gly Asp Gly Val Tyr Gly Phe Gl - #u Tyr Tyr Pro Met Val     #               545     - CCT GGA ACC TAT ATC GTC ACC ATC ACG TGG GG - #T GGT CAG AAC ATC GGG     1857     Pro Gly Thr Tyr Ile Val Thr Ile Thr Trp Gl - #y Gly Gln Asn Ile Gly     #           560     - CGC AGT CCC TTC GAA GTG AAG GTG GGC ACC GA - #G TGT GGC AAT CAG AAG     1905     Arg Ser Pro Phe Glu Val Lys Val Gly Thr Gl - #u Cys Gly Asn Gln Lys     #       575     - GTA CGG GCC TGG GGC CCT GGG CTG GAG GGC GG - #C GTC GTT GGC AAG TCA     1953     Val Arg Ala Trp Gly Pro Gly Leu Glu Gly Gl - #y Val Val Gly Lys Ser     #   590     - GCA GAC TTT GTG GTG GAG GCT ATC GGG GAC GA - #C GTG GGC ACG CTG GGC     2001     Ala Asp Phe Val Val Glu Ala Ile Gly Asp As - #p Val Gly Thr Leu Gly     595                 6 - #00                 6 - #05                 6 -     #10     - TTC TCG GTG GAA GGG CCA TCG CAG GCT AAG AT - #C GAA TGT GAC GAC AAG     2049     Phe Ser Val Glu Gly Pro Ser Gln Ala Lys Il - #e Glu Cys Asp Asp Lys     #               625     - GGC GAC GGC TCC TGT GAT GTG CGC TAC TGG CC - #G CAG GAG GCT GGC GAG     2097     Gly Asp Gly Ser Cys Asp Val Arg Tyr Trp Pr - #o Gln Glu Ala Gly Glu     #           640     - TAT GCC GTT CAC GTG CTG TGC AAC AGC GAA GA - #C ATC CGC CTC AGC CCC     2145     Tyr Ala Val His Val Leu Cys Asn Ser Glu As - #p Ile Arg Leu Ser Pro     #       655     - TTC ATG GCT GAC ATC CGT GAC GCG CCC CAG GA - #C TTC CAC CCA GAC AGG     2193     Phe Met Ala Asp Ile Arg Asp Ala Pro Gln As - #p Phe His Pro Asp Arg     #   670     - GTG AAG GCA CGT GGG CCT GGA TTG GAG AAG AC - #A GGT GTG GCC GTC AAC     2241     Val Lys Ala Arg Gly Pro Gly Leu Glu Lys Th - #r Gly Val Ala Val Asn     675                 6 - #80                 6 - #85                 6 -     #90     - AAG CCA GCA GAG TTC ACA GTG GAT GCC AAG CA - #C GGT GGC AAG GCC CCA     2289     Lys Pro Ala Glu Phe Thr Val Asp Ala Lys Hi - #s Gly Gly Lys Ala Pro     #               705     - CTT CGG GTC CAA GTC CAG GAC AAT GAA GGC TG - #C CCT GTG GAG GCG TTG     2337     Leu Arg Val Gln Val Gln Asp Asn Glu Gly Cy - #s Pro Val Glu Ala Leu     #           720     - GTC AAG GAC AAC GGC AAT GGC ACT TAC AGC TG - #C TCC TAC GTG CCC AGG     2385     Val Lys Asp Asn Gly Asn Gly Thr Tyr Ser Cy - #s Ser Tyr Val Pro Arg     #       735     - AAG CCG GTG AAG CAC ACA GCC ATG GTG TCC TG - #G GGA GGC GTC AGC ATC     2433     Lys Pro Val Lys His Thr Ala Met Val Ser Tr - #p Gly Gly Val Ser Ile     #   750     - CCC AAC AGC CCC TTC AGG GTG AAT GTG GGA GC - #T GGC AGC CAC CCC AAC     2481     Pro Asn Ser Pro Phe Arg Val Asn Val Gly Al - #a Gly Ser His Pro Asn     755                 7 - #60                 7 - #65                 7 -     #70     - AAG GTC AAA GTA TAC GGC CCC GGA GTA GCC AA - #G ACA GGG CTC AAG GCC     2529     Lys Val Lys Val Tyr Gly Pro Gly Val Ala Ly - #s Thr Gly Leu Lys Ala     #               785     - CAC GAG CCC ACC TAC TTC ACT GTG GAC TGC GC - #C GAG GCT GGC CAG GGG     2577     His Glu Pro Thr Tyr Phe Thr Val Asp Cys Al - #a Glu Ala Gly Gln Gly     #           800     - GAC GTC AGC ATC GGC ATC AAG TGT GCC CCT GG - #A GTG GTA GGC CCC GCC     2625     Asp Val Ser Ile Gly Ile Lys Cys Ala Pro Gl - #y Val Val Gly Pro Ala     #       815     - GAA GCT GAC ATC GAC TTC GAC ATC ATC CGC AA - #T GAC AAT GAC ACC TTC     2673     Glu Ala Asp Ile Asp Phe Asp Ile Ile Arg As - #n Asp Asn Asp Thr Phe     #   830     - ACG GTC AAG TAC ACG CCC CGG GGG GCT GGC AG - #C TAC ACC ATT ATG GTC     2721     Thr Val Lys Tyr Thr Pro Arg Gly Ala Gly Se - #r Tyr Thr Ile Met Val     835                 8 - #40                 8 - #45                 8 -     #50     - CTC TTT GCT GAC CAG GCC ACG CCC ACC AGC CC - #C ATC CGA GTC AAG GTG     2769     Leu Phe Ala Asp Gln Ala Thr Pro Thr Ser Pr - #o Ile Arg Val Lys Val     #               865     - GAG CCC TCT CAT GAC GCC AGT AAG GTG AAG GC - #C GAG GGC CCT GGC CTC     2817     Glu Pro Ser His Asp Ala Ser Lys Val Lys Al - #a Glu Gly Pro Gly Leu     #           880     - AGT CGC ACT GGT GTC GAG CTT GGC AAG CCC AC - #C CAC TTC ACA GTA AAT     2865     Ser Arg Thr Gly Val Glu Leu Gly Lys Pro Th - #r His Phe Thr Val Asn     #       895     - GCC AAA GCT GCT GGC AAA GGC AAG CTG GAC GT - #C CAG TTC TCA GGA CTC     2913     Ala Lys Ala Ala Gly Lys Gly Lys Leu Asp Va - #l Gln Phe Ser Gly Leu     #   910     - ACC AAG GGG GAT GCA GTG CGA GAT GTG GAC AT - #C ATC GAC CAC CAT GAC     2961     Thr Lys Gly Asp Ala Val Arg Asp Val Asp Il - #e Ile Asp His His Asp     915                 9 - #20                 9 - #25                 9 -     #30     - AAC ACC TAC ACA GTC AAG TAC ACG CCT GTC CA - #G CAG GGT CCA GTA GGC     3009     Asn Thr Tyr Thr Val Lys Tyr Thr Pro Val Gl - #n Gln Gly Pro Val Gly     #               945     - GTC AAT GTC ACT TAT GGA GGG GAT CCC ATC CC - #T AAG AGC CCT TTC TCA     3057     Val Asn Val Thr Tyr Gly Gly Asp Pro Ile Pr - #o Lys Ser Pro Phe Ser     #           960     - GTG GCA GTA TCT CCA AGC CTG GAC CTC AGC AA - #G ATC AAG GTG TCT GGC     3105     Val Ala Val Ser Pro Ser Leu Asp Leu Ser Ly - #s Ile Lys Val Ser Gly     #       975     - CTG GGA GAG AAG GTG GAC GTT GGC AAA GAC CA - #G GAG TTC ACA GTC AAA     3153     Leu Gly Glu Lys Val Asp Val Gly Lys Asp Gl - #n Glu Phe Thr Val Lys     #   990     - TCA AAG GGT GCT GGT GGT CAA GGC AAA GTG GC - #A TCC AAG ATT GTG GGC     3201     Ser Lys Gly Ala Gly Gly Gln Gly Lys Val Al - #a Ser Lys Ile Val Gly     995                 1 - #000                1005 - #                1010     - CCC TCG GGT GCA GCG GTG CCC TGC AAG GTG GA - #G CCA GGC CTG GGG GCT     3249     Pro Ser Gly Ala Ala Val Pro Cys Lys Val Gl - #u Pro Gly Leu Gly Ala     #              10250     - GAC AAC AGT GTG GTG CGC TTC CTG CCC CGT GA - #G GAA GGG CCC TAT GAG     3297     Asp Asn Ser Val Val Arg Phe Leu Pro Arg Gl - #u Glu Gly Pro Tyr Glu     #          10405     - GTG GAG GTG ACC TAT GAC GGC GTG CCC GTG CC - #T GGC AGC CCC TTT CCT     3345     Val Glu Val Thr Tyr Asp Gly Val Pro Val Pr - #o Gly Ser Pro Phe Pro     #      10550     - CTG GAA GCT GTG GCC CCC ACC AAG CCT AGC AA - #G GTG AAG GCG TTT GGG     3393     Leu Glu Ala Val Ala Pro Thr Lys Pro Ser Ly - #s Val Lys Ala Phe Gly     #  10705     - CCG GGG CTG CAG GGA GGC AGT GCG GGC TCC CC - #C GCC CGC TTC ACC ATC     3441     Pro Gly Leu Gln Gly Gly Ser Ala Gly Ser Pr - #o Ala Arg Phe Thr Ile     #               10901080 - #                1085     - GAC ACC AAG GGC GCC GGC ACA GGT GGC CTG GG - #C CTG ACG GTG GAG GGC     3489     Asp Thr Lys Gly Ala Gly Thr Gly Gly Leu Gl - #y Leu Thr Val Glu Gly     #              11050     - CCC TGT GAG GCG CAG CTC GAG TGC TTG GAC AA - #T GGG GAT GGC ACA TGT     3537     Pro Cys Glu Ala Gln Leu Glu Cys Leu Asp As - #n Gly Asp Gly Thr Cys     #          11205     - TCC GTG TCC TAC GTG CCC ACC GAG CCC GGG GA - #C TAC AAC ATC AAC ATC     3585     Ser Val Ser Tyr Val Pro Thr Glu Pro Gly As - #p Tyr Asn Ile Asn Ile     #      11350     - CTC TTC GCT GAC ACC CAC ATC CCT GGC TCC CC - #A TTC AAG GCC CAC GTG     3633     Leu Phe Ala Asp Thr His Ile Pro Gly Ser Pr - #o Phe Lys Ala His Val     #  11505     - GTT CCC TGC TTT GAC GCA TCC AAA GTC AAG TG - #C TCA GGC CCC GGG CTG     3681     Val Pro Cys Phe Asp Ala Ser Lys Val Lys Cy - #s Ser Gly Pro Gly Leu     #               11701160 - #                1165     - GAG CGG GCC ACC GCT GGG GAG GTG GGC CAA TT - #C CAA GTG GAC TGC TCG     3729     Glu Arg Ala Thr Ala Gly Glu Val Gly Gln Ph - #e Gln Val Asp Cys Ser     #              11850     - AGC GCG GGC AGC GCG GAG CTG ACC ATT GAG AT - #C TGC TCG GAG GCG GGG     3777     Ser Ala Gly Ser Ala Glu Leu Thr Ile Glu Il - #e Cys Ser Glu Ala Gly     #          12005     - CTT CCG GCC GAG GTG TAC ATC CAG GAC CAC GG - #T GAT GGC ACG CAC ACC     3825     Leu Pro Ala Glu Val Tyr Ile Gln Asp His Gl - #y Asp Gly Thr His Thr     #      12150     - ATT ACC TAC ATT CCC CTC TGC CCC GGG GCC TA - #C ACC GTC ACC ATC AAG     3873     Ile Thr Tyr Ile Pro Leu Cys Pro Gly Ala Ty - #r Thr Val Thr Ile Lys     #  12305     - TAC GGC GGC CAG CCC GTG CCC AAC TTC CCC AG - #C AAG CTG CAG GTG GAA     3921     Tyr Gly Gly Gln Pro Val Pro Asn Phe Pro Se - #r Lys Leu Gln Val Glu     #               12501240 - #                1245     - CCT GCG GTG GAC ACT TCC GGT GTC CAG TGC TA - #T GGG CCT GGT ATT GAG     3969     Pro Ala Val Asp Thr Ser Gly Val Gln Cys Ty - #r Gly Pro Gly Ile Glu     #              12650     - GGC CAG GGT GTC TTC CGT GAG GCC ACC ACT GA - #G TTC AGT GTG GAC GCC     4017     Gly Gln Gly Val Phe Arg Glu Ala Thr Thr Gl - #u Phe Ser Val Asp Ala     #          12805     - CGG GCT CTG ACA CAG ACC GGA GGG CCG CAC GT - #C AAG GCC CGT GTG GCC     4065     Arg Ala Leu Thr Gln Thr Gly Gly Pro His Va - #l Lys Ala Arg Val Ala     #      12950     - AAC CCC TCA GGC AAC CTG ACG GAG ACC TAC GT - #T CAG GAC CGT GGC GAT     4113     Asn Pro Ser Gly Asn Leu Thr Glu Thr Tyr Va - #l Gln Asp Arg Gly Asp     #  13105     - GGC ATG TAC AAA GTG GAG TAC ACG CCT TAC GA - #G GAG GGA CTG CAC TCC     4161     Gly Met Tyr Lys Val Glu Tyr Thr Pro Tyr Gl - #u Glu Gly Leu His Ser     #               13301320 - #                1325     - GTG GAC GTG ACC TAT GAC GGC AGT CCC GTG CC - #C AGC AGC CCC TTC CAG     4209     Val Asp Val Thr Tyr Asp Gly Ser Pro Val Pr - #o Ser Ser Pro Phe Gln     #              13450     - GTG CCC GTG ACC GAG GGC TGC GAC CCC TCC CG - #G GTG CGT GTC CAC GGG     4257     Val Pro Val Thr Glu Gly Cys Asp Pro Ser Ar - #g Val Arg Val His Gly     #          13605     - CCA GGC ATC CAA AGT GGC ACC ACC AAC AAG CC - #C AAC AAG TTC ACT GTG     4305     Pro Gly Ile Gln Ser Gly Thr Thr Asn Lys Pr - #o Asn Lys Phe Thr Val     #      13750     - GAG ACC AGG GGA GCT GGC ACG GGC GGC CTG GG - #C CTG GCT GTA GAG GGC     4353     Glu Thr Arg Gly Ala Gly Thr Gly Gly Leu Gl - #y Leu Ala Val Glu Gly     #  13905     - CCC TCC GAG GCC AAG ATG TCC TGC ATG GAT AA - #C AAG GAC GGC AGC TGC     4401     Pro Ser Glu Ala Lys Met Ser Cys Met Asp As - #n Lys Asp Gly Ser Cys     #               14101400 - #                1405     - TCG GTC GAG TAC ATC CCT TAT GAG GCT GGC AC - #C TAC AGC CTC AAC GTC     4449     Ser Val Glu Tyr Ile Pro Tyr Glu Ala Gly Th - #r Tyr Ser Leu Asn Val     #              14250     - ACC TAT GGT GGC CAT CAA GTG CCA GGC AGT CC - #T TTC AAG GTC CCT GTG     4497     Thr Tyr Gly Gly His Gln Val Pro Gly Ser Pr - #o Phe Lys Val Pro Val     #          14405     - CAT GAT GTG ACA GAT GCG TCC AAG GTC AAG TG - #C TCT GGG CCC GGC CTG     4545     His Asp Val Thr Asp Ala Ser Lys Val Lys Cy - #s Ser Gly Pro Gly Leu     #      14550     - AGC CCA GGC ATG GTT CGT GCC AAC CTC CCT CA - #G TCC TTC CAG GTG GAC     4593     Ser Pro Gly Met Val Arg Ala Asn Leu Pro Gl - #n Ser Phe Gln Val Asp     #  14705     - ACA AGC AAG GCT GGT GTG GCC CCA TTG CAG GT - #C AAA GTG CAA GGG CCC     4641     Thr Ser Lys Ala Gly Val Ala Pro Leu Gln Va - #l Lys Val Gln Gly Pro     #               14901480 - #                1485     - AAA GGC CTG GTG GAG CCA GTG GAC GTG GTA GA - #C AAC GCT GAT GGC ACC     4689     Lys Gly Leu Val Glu Pro Val Asp Val Val As - #p Asn Ala Asp Gly Thr     #              15050     - CAG ACC GTC AAT TAT GTG CCC AGC CGA GAA GG - #G CCC TAC AGC ATC TCA     4737     Gln Thr Val Asn Tyr Val Pro Ser Arg Glu Gl - #y Pro Tyr Ser Ile Ser     #          15205     - GTA CTG TAT GGA GAT GAA GAG GTA CCC CGG AG - #C CCC TTC AAG GTC AAG     4785     Val Leu Tyr Gly Asp Glu Glu Val Pro Arg Se - #r Pro Phe Lys Val Lys     #      15350     - GTG CTG CCT ACT CAT GAT GCC AGC AAG GTG AA - #G GCC AGT GGC CCC GGG     4833     Val Leu Pro Thr His Asp Ala Ser Lys Val Ly - #s Ala Ser Gly Pro Gly     #  15505     - CTC AAC ACC ACT GGC GTG CCT GCC AGC CTG CC - #C GTG GAG TTC ACC ATC     4881     Leu Asn Thr Thr Gly Val Pro Ala Ser Leu Pr - #o Val Glu Phe Thr Ile     #               15701560 - #                1565     - GAT GCA AAG GAC GCC GGG GAG GGC CTG CTG GC - #T GTC CAG ATC ACG GAT     4929     Asp Ala Lys Asp Ala Gly Glu Gly Leu Leu Al - #a Val Gln Ile Thr Asp     #              15850     - CCC GAA GGC AAG CCG AAG AAG ACA CAC ATC CA - #A GAC AAC CAT GAC GGC     4977     Pro Glu Gly Lys Pro Lys Lys Thr His Ile Gl - #n Asp Asn His Asp Gly     #          16005     - ACG TAT ACA GTG GCC TAC GTG CCA GAC GTG AC - #A GGT CGC TAC ACC ATC     5025     Thr Tyr Thr Val Ala Tyr Val Pro Asp Val Th - #r Gly Arg Tyr Thr Ile     #      16150     - CTC ATC AAG TAC GGT GGT GAC GAG ATC CCC TT - #C TCC CCG TAC CGC GTG     5073     Leu Ile Lys Tyr Gly Gly Asp Glu Ile Pro Ph - #e Ser Pro Tyr Arg Val     #  16305     - CGT GCC GTG CCC ACC GGG GAC GCC AGC AAG TG - #C ACT GTC ACA GTG TCA     5121     Arg Ala Val Pro Thr Gly Asp Ala Ser Lys Cy - #s Thr Val Thr Val Ser     #               16501640 - #                1645     - ATC GGA GGT CAC GGG CTA GGT GCT GGC ATC GG - #C CCC ACC ATT CAG ATT     5169     Ile Gly Gly His Gly Leu Gly Ala Gly Ile Gl - #y Pro Thr Ile Gln Ile     #              16650     - GGG GAG GAG ACG GTG ATC ACT GTG GAC ACT AA - #G GCG GCA GGC AAA GGC     5217     Gly Glu Glu Thr Val Ile Thr Val Asp Thr Ly - #s Ala Ala Gly Lys Gly     #          16805     - AAA GTG ACG TGC ACC GTG TGC ACG CCT GAT GG - #C TCA GAG GTG GAT GTG     5265     Lys Val Thr Cys Thr Val Cys Thr Pro Asp Gl - #y Ser Glu Val Asp Val     #      16950     - GAC GTG GTG GAG AAT GAG GAC GGC ACT TTC GA - #C ATC TTC TAC ACG GCC     5313     Asp Val Val Glu Asn Glu Asp Gly Thr Phe As - #p Ile Phe Tyr Thr Ala     #  17105     - CCC CAG CCG GGC AAA TAC GTC ATC TGT GTG CG - #C TTT GGT GGC GAG CAC     5361     Pro Gln Pro Gly Lys Tyr Val Ile Cys Val Ar - #g Phe Gly Gly Glu His     #               17301720 - #                1725     - GTG CCC AAC AGC CCC TTC CAA GTG ACG GCT CT - #G GCT GGG GAC CAG CCC     5409     Val Pro Asn Ser Pro Phe Gln Val Thr Ala Le - #u Ala Gly Asp Gln Pro     #              17450     - TCG GTG CAG CCC CCT CTA CGG TCT CAG CAG CT - #G GCC CCA CAG TAC ACC     5457     Ser Val Gln Pro Pro Leu Arg Ser Gln Gln Le - #u Ala Pro Gln Tyr Thr     #          17605     - TAC GCC CAG GGC GGC CAG CAG ACT TGG GCC CC - #G GAG AGG CCC CTG GTG     5505     Tyr Ala Gln Gly Gly Gln Gln Thr Trp Ala Pr - #o Glu Arg Pro Leu Val     #      17750     - GGT GTC AAT GGG CTG GAT GTG ACC AGC CTG AG - #G CCC TTT GAC CTT GTC     5553     Gly Val Asn Gly Leu Asp Val Thr Ser Leu Ar - #g Pro Phe Asp Leu Val     #  17905     - ATC CCC TTC ACC ATC AAG AAG GGC GAG ATC AC - #A GGG GAG GTT CGG ATG     5601     Ile Pro Phe Thr Ile Lys Lys Gly Glu Ile Th - #r Gly Glu Val Arg Met     #               18101800 - #                1805     - CCC TCA GGC AAG GTG GCG CAG CCC ACC ATC AC - #T GAC AAC AAA GAC GGC     5649     Pro Ser Gly Lys Val Ala Gln Pro Thr Ile Th - #r Asp Asn Lys Asp Gly     #              18250     - ACC GTG ACC GTG CGG TAT GCA CCC AGC GAG GC - #T GGC CTG CAC GAG ATG     5697     Thr Val Thr Val Arg Tyr Ala Pro Ser Glu Al - #a Gly Leu His Glu Met     #          18405     - GAC ATC CGC TAT GAC AAC ATG CAC ATC CCA GG - #A AGC CCC TTG CAG TTC     5745     Asp Ile Arg Tyr Asp Asn Met His Ile Pro Gl - #y Ser Pro Leu Gln Phe     #      18550     - TAT GTG GAT TAC GTC AAC TGT GGC CAT GTC AC - #T GCC TAT GGG CCT GGC     5793     Tyr Val Asp Tyr Val Asn Cys Gly His Val Th - #r Ala Tyr Gly Pro Gly     #  18705     - CTC ACC CAT GGA GTA GTG AAC AAG CCT GCC AC - #C TTC ACC GTC AAC ACC     5841     Leu Thr His Gly Val Val Asn Lys Pro Ala Th - #r Phe Thr Val Asn Thr     #               18901880 - #                1885     - AAG GAT GCA GGA GAG GGG GGC CTG TCT CTG GC - #C ATT GAG GGC CCG TCC     5889     Lys Asp Ala Gly Glu Gly Gly Leu Ser Leu Al - #a Ile Glu Gly Pro Ser     #              19050     - AAA GCA GAA ATC AGC TGC ACT GAC AAC CAG GA - #T GGG ACA TGC AGC GTG     5937     Lys Ala Glu Ile Ser Cys Thr Asp Asn Gln As - #p Gly Thr Cys Ser Val     #          19205     - TCC TAC CTG CCT GTG CTG CCG GGG GAC TAC AG - #C ATT CTA GTC AAG TAC     5985     Ser Tyr Leu Pro Val Leu Pro Gly Asp Tyr Se - #r Ile Leu Val Lys Tyr     #      19350     - AAT GAA CAG CAC GTC CCA GGC AGC CCC TTC AC - #T GCT CGG GTC ACA GGT     6033     Asn Glu Gln His Val Pro Gly Ser Pro Phe Th - #r Ala Arg Val Thr Gly     #  19505     - GAC GAC TCC ATG CGT ATG TCC CAC CTA AAG GT - #C GGC TCT GCT GCC GAC     6081     Asp Asp Ser Met Arg Met Ser His Leu Lys Va - #l Gly Ser Ala Ala Asp     #               19701960 - #                1965     - ATC CCC ATC AAC ATC TCA GAG ACG GAT CTC AG - #C CTG CTG ACG GCC ACT     6129     Ile Pro Ile Asn Ile Ser Glu Thr Asp Leu Se - #r Leu Leu Thr Ala Thr     #              19850     - GTG GTC CCG CCC TCG GGC CGG GAG GAG CCC TG - #T TTG CTG AAG CGG CTG     6177     Val Val Pro Pro Ser Gly Arg Glu Glu Pro Cy - #s Leu Leu Lys Arg Leu     #          20005     - CGT AAT GGC CAC GTG GGG ATT TCA TTC GTG CC - #C AAG GAG ACG GGG GAG     6225     Arg Asn Gly His Val Gly Ile Ser Phe Val Pr - #o Lys Glu Thr Gly Glu     #      20150     - CAC CTG GTG CAT GTG AAG AAA AAT GGC CAG CA - #C GTG GCC AGC AGC CCC     6273     His Leu Val His Val Lys Lys Asn Gly Gln Hi - #s Val Ala Ser Ser Pro     #  20305     - ATC CCG GTG GTG ATC AGC CAG TCG GAA ATT GG - #G GAT GCC AGT CGT GTT     6321     Ile Pro Val Val Ile Ser Gln Ser Glu Ile Gl - #y Asp Ala Ser Arg Val     #               20502040 - #                2045     - CGG GTC TCT GGT CAG GGC CTT CAC GAA GGC CA - #C ACC TTT GAG CCT GCA     6369     Arg Val Ser Gly Gln Gly Leu His Glu Gly Hi - #s Thr Phe Glu Pro Ala     #              20650     - GAG TTT ATC ATT GAT ACC CGC GAT GCA GGC TA - #T GGT GGG CTC AGC CTG     6417     Glu Phe Ile Ile Asp Thr Arg Asp Ala Gly Ty - #r Gly Gly Leu Ser Leu     #          20805     - TCC ATT GAG GGC CCC AGC AAG GTG GAC ATC AA - #C ACA GAG GAC CTG GAG     6465     Ser Ile Glu Gly Pro Ser Lys Val Asp Ile As - #n Thr Glu Asp Leu Glu     #      20950     - GAC GGG ACG TGC AGG GTC ACC TAC TGC CCC AC - #A GAG CCA GGC AAC TAC     6513     Asp Gly Thr Cys Arg Val Thr Tyr Cys Pro Th - #r Glu Pro Gly Asn Tyr     #  21105     - ATC ATC AAC ATC AAG TTT GCC GAC CAG CAC GT - #G CCT GGC AGC CCC TTC     6561     Ile Ile Asn Ile Lys Phe Ala Asp Gln His Va - #l Pro Gly Ser Pro Phe     #               21302120 - #                2125     - TCT GTG AAG GTG ACA GGC GAG GGC CGG GTG AA - #A GAG AGC ATC ACC CGC     6609     Ser Val Lys Val Thr Gly Glu Gly Arg Val Ly - #s Glu Ser Ile Thr Arg     #              21450     - AGG CGT CGG GCT CCT TCA GTG GCC AAC GTT GG - #T AGT CAT TGT GAC CTC     6657     Arg Arg Arg Ala Pro Ser Val Ala Asn Val Gl - #y Ser His Cys Asp Leu     #          21605     - AGC CTG AAA ATC CCT GAA ATT AGC ATC CAG GA - #T ATG ACA GCC CAG GTG     6705     Ser Leu Lys Ile Pro Glu Ile Ser Ile Gln As - #p Met Thr Ala Gln Val     #      21750     - ACC AGC CCA TCG GGC AAG ACC CAT GAG GCC GA - #G ATC GTG GAA GGG GAG     6753     Thr Ser Pro Ser Gly Lys Thr His Glu Ala Gl - #u Ile Val Glu Gly Glu     #  21905     - AAC CAC ACC TAC TGC ATC CGC TTT GTT CCC GC - #T GAG ATG GGC ACA CAC     6801     Asn His Thr Tyr Cys Ile Arg Phe Val Pro Al - #a Glu Met Gly Thr His     #               22102200 - #                2205     - ACA GTC AGC GTC AAG TAC AAG GGC CAG CAC GT - #G CCT GGG AGC CCC TTC     6849     Thr Val Ser Val Lys Tyr Lys Gly Gln His Va - #l Pro Gly Ser Pro Phe     #              22250     - CAG TTC ACC GTG GGG CCC CTA GGG GAA GGG GG - #A GCC CAC AAG GTC CGA     6897     Gln Phe Thr Val Gly Pro Leu Gly Glu Gly Gl - #y Ala His Lys Val Arg     #          22405     - GCT GGG GGC CCT GGC CTG GAG AGA GCT GAA GC - #T GGA GTG CCA GCC GAA     6945     Ala Gly Gly Pro Gly Leu Glu Arg Ala Glu Al - #a Gly Val Pro Ala Glu     #      22550     - TTC AGT ATC TGG ACC CGG GAA GCT GGT GCT GG - #A GGC CTG GCC ATT GCT     6993     Phe Ser Ile Trp Thr Arg Glu Ala Gly Ala Gl - #y Gly Leu Ala Ile Ala     #  22705     - GTC GAG GGC CCC AGC AAG GCT GAG ATC TCT TT - #T GAG GAC CGC AAG GAC     7041     Val Glu Gly Pro Ser Lys Ala Glu Ile Ser Ph - #e Glu Asp Arg Lys Asp     #               22902280 - #                2285     - GGC TCC TGT GGT GTG GCT TAT GTG GTC CAG GA - #G CCA GGT GAC TAC GAA     7089     Gly Ser Cys Gly Val Ala Tyr Val Val Gln Gl - #u Pro Gly Asp Tyr Glu     #              23050     - GTC TCA GTC AAG TTC AAC GAG GAA CAC ATT CC - #C GAC AGC CCC TTC GTG     7137     Val Ser Val Lys Phe Asn Glu Glu His Ile Pr - #o Asp Ser Pro Phe Val     #          23205     - GTG CCT GTG GCT TCT CCG TCT GGC GAC GCC CG - #C CGC CTC ACT GTT TCT     7185     Val Pro Val Ala Ser Pro Ser Gly Asp Ala Ar - #g Arg Leu Thr Val Ser     #      23350     - AGC CTT CAG GAG TCA GGG CTA AAG GTC AAC CA - #G CCA GCC TCT TTT GCA     7233     Ser Leu Gln Glu Ser Gly Leu Lys Val Asn Gl - #n Pro Ala Ser Phe Ala     #  23505     - GTC AGC CTG AAC GGG GCC AAG GGG GCG ATC GA - #T GCC AAG GTG CAC AGC     7281     Val Ser Leu Asn Gly Ala Lys Gly Ala Ile As - #p Ala Lys Val His Ser     #               23702360 - #                2365     - CCC TCA GGA GCC CTG GAG GAG TGC TAT GTC AC - #A GAA ATT GAC CAA GAT     7329     Pro Ser Gly Ala Leu Glu Glu Cys Tyr Val Th - #r Glu Ile Asp Gln Asp     #              23850     - AAG TAT GCT GTG CGC TTC ATC CCT CGG GAG AA - #T GGC GTT TAC CTG ATT     7377     Lys Tyr Ala Val Arg Phe Ile Pro Arg Glu As - #n Gly Val Tyr Leu Ile     #          24005     - GAC GTC AAG TTC AAC GGT ACC CAC ATC CCT GG - #A AGC CCC TTC AAG ATC     7425     Asp Val Lys Phe Asn Gly Thr His Ile Pro Gl - #y Ser Pro Phe Lys Ile     #      24150     - CGA GTT GGG GAG CCT GGG CAT GGA GGG GAC CC - #A GGC TTG GTG TCT GCT     7473     Arg Val Gly Glu Pro Gly His Gly Gly Asp Pr - #o Gly Leu Val Ser Ala     #  24305     - TAC GGA GCA GGT CTG GAA GGC GGT GTC ACA GG - #G AAC CCA GCT GAG TTC     7521     Tyr Gly Ala Gly Leu Glu Gly Gly Val Thr Gl - #y Asn Pro Ala Glu Phe     #               24502440 - #                2445     - GTC GTG AAC ACG AGC AAT GCG GGA GCT GGT GC - #C CTG TCG GTG ACC ATT     7569     Val Val Asn Thr Ser Asn Ala Gly Ala Gly Al - #a Leu Ser Val Thr Ile     #              24650     - GAC GGC CCC TCC AAG GTG AAG ATG GAT TGC CA - #G GAG TGC CCT GAG GGC     7617     Asp Gly Pro Ser Lys Val Lys Met Asp Cys Gl - #n Glu Cys Pro Glu Gly     #          24805     - TAC CGC GTC ACC TAT ACC CCC ATG GCA CCT GG - #C AGC TAC CTC ATC TCC     7665     Tyr Arg Val Thr Tyr Thr Pro Met Ala Pro Gl - #y Ser Tyr Leu Ile Ser     #      24950     - ATC AAG TAC GGC GGC CCC TAC CAC ATT GGG GG - #C AGC CCC TTC AAG GCC     7713     Ile Lys Tyr Gly Gly Pro Tyr His Ile Gly Gl - #y Ser Pro Phe Lys Ala     #  25105     - AAA GTC ACA GGC CCC CGT CTC GTC AGC AAC CA - #C AGC CTC CAC GAG ACA     7761     Lys Val Thr Gly Pro Arg Leu Val Ser Asn Hi - #s Ser Leu His Glu Thr     #               25302520 - #                2525     - TCA TCA GTG TTT GTA GAC TCT CTG ACC AAG GC - #C ACC TGT GCC CCC CAG     7809     Ser Ser Val Phe Val Asp Ser Leu Thr Lys Al - #a Thr Cys Ala Pro Gln     #              25450     - CAT GGG GCC CCG GGT CCT GGG CCT GCT GAC GC - #C AGC AAG GTG GTG GCC     7857     His Gly Ala Pro Gly Pro Gly Pro Ala Asp Al - #a Ser Lys Val Val Ala     #          25605     - AAG GGC CTG GGG CTG AGC AAG GCC TAC GTA GG - #C CAG AAG AGC AGC TTC     7905     Lys Gly Leu Gly Leu Ser Lys Ala Tyr Val Gl - #y Gln Lys Ser Ser Phe     #      25750     - ACA GTA GAC TGC AGC AAA GCA GGC AAC AAC AT - #G CTG CTG GTG GGG GTT     7953     Thr Val Asp Cys Ser Lys Ala Gly Asn Asn Me - #t Leu Leu Val Gly Val     #  25905     - CAT GGC CCA AGG ACC CCC TGC GAG GAG ATC CT - #G GTG AAG CAC GTG GGC     8001     His Gly Pro Arg Thr Pro Cys Glu Glu Ile Le - #u Val Lys His Val Gly     #               26102600 - #                2605     - AGC CGG CTC TAC AGC GTG TCC TAC CTG CTC AA - #G GAC AAG GGG GAG TAC     8049     Ser Arg Leu Tyr Ser Val Ser Tyr Leu Leu Ly - #s Asp Lys Gly Glu Tyr     #              26250     - ACA CTG GTG GTC AAA TGG GGG CAC GAG CAC AT - #C CCA GGC AGC CCC TAC     8097     Thr Leu Val Val Lys Trp Gly His Glu His Il - #e Pro Gly Ser Pro Tyr     #          26405     - CGC GTT GTG GTG CCC TGAGTCTGGG GCCCGTGCCA GCCGGCAGC - #C CCCAAGCCTG     8152     Arg Val Val Val Pro             2645     - CCCCGCTACC CAAGCAGCCC CGCCCTCTTC CCCTCAACCC CGGCCCAGGC CG - #CCCTGGCC     8212     - GCCCGCCTGT CACTGCAGCT GCCCCTGCCC TGTGCCGTGC TGCGCTCACC TG - #CCTCCCCA     8272     - GCCAGCCGCT GACCTCTCGG CTTTCACTTG GGCAGAGGGA GCCATTTGGT GG - #CGCTGCTT     8332     #     8367         GGAG GGGTGAGGGA TGGGG     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 2647 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     - Met Ser Ser Ser His Ser Arg Ala Gly Gln Se - #r Ala Ala Gly Ala Ala     #                 15     - Pro Gly Gly Gly Val Asp Thr Arg Asp Ala Gl - #u Met Pro Ala Thr Glu     #             30     - Lys Asp Leu Ala Glu Asp Ala Pro Trp Lys Ly - #s Ile Gln Gln Asn Thr     #         45     - Phe Thr Arg Trp Cys Asn Glu His Leu Lys Cy - #s Val Ser Lys Arg Ile     #     60     - Ala Asn Leu Gln Thr Asp Leu Ser Asp Gly Le - #u Arg Leu Ile Ala Leu     # 80     - Leu Glu Val Leu Ser Gln Lys Lys Met His Ar - #g Lys His Asn Gln Arg     #                 95     - Pro Thr Phe Arg Gln Met Gln Leu Glu Asn Va - #l Ser Val Ala Leu Glu     #           110     - Phe Leu Asp Arg Glu Ser Ile Lys Leu Val Se - #r Ile Asp Ser Lys Ala     #       125     - Ile Val Asp Gly Asn Leu Lys Leu Ile Leu Gl - #y Leu Ile Trp Thr Leu     #   140     - Ile Leu His Tyr Ser Ile Ser Met Pro Met Tr - #p Asp Glu Glu Glu Asp     145                 1 - #50                 1 - #55                 1 -     #60     - Glu Glu Ala Lys Lys Gln Thr Pro Lys Gln Ar - #g Leu Leu Gly Trp Ile     #               175     - Gln Asn Lys Leu Pro Gln Leu Pro Ile Thr As - #n Phe Ser Arg Asp Trp     #           190     - Gln Ser Gly Arg Ala Leu Gly Ala Leu Val As - #p Ser Cys Ala Pro Gly     #       205     - Leu Cys Pro Asp Trp Asp Ser Trp Asp Ala Se - #r Lys Pro Val Thr Asn     #   220     - Ala Arg Glu Ala Met Gln Gln Ala Asp Asp Tr - #p Leu Gly Ile Pro Gln     225                 2 - #30                 2 - #35                 2 -     #40     - Val Ile Thr Pro Glu Glu Ile Val Asp Pro As - #n Val Asp Glu His Ser     #               255     - Val Met Thr Tyr Leu Ser Gln Phe Pro Lys Al - #a Lys Leu Lys Pro Gly     #           270     - Ala Pro Leu Arg Pro Lys Leu Asn Pro Lys Ly - #s Ala Arg Ala Tyr Gly     #       285     - Pro Gly Ile Glu Pro Thr Gly Asn Met Val Ly - #s Lys Arg Ala Glu Phe     #   300     - Thr Val Glu Thr Arg Ser Ala Gly Gln Gly Gl - #u Val Leu Val Tyr Val     305                 3 - #10                 3 - #15                 3 -     #20     - Glu Asp Pro Ala Gly His Gln Glu Glu Ala Ly - #s Val Thr Ala Asn Asn     #               335     - Asp Lys Asn Arg Thr Phe Ser Val Trp Tyr Va - #l Pro Glu Val Thr Gly     #           350     - Thr His Lys Val Thr Val Leu Phe Ala Gly Gl - #n His Ile Ala Lys Ser     #       365     - Pro Phe Glu Val Tyr Val Asp Lys Ser Gln Gl - #y Asp Ala Ser Lys Val     #   380     - Thr Ala Gln Gly Pro Gly Leu Glu Pro Ser Gl - #y Asn Ile Ala Asn Lys     385                 3 - #90                 3 - #95                 4 -     #00     - Thr Thr Tyr Phe Glu Ile Phe Thr Ala Gly Al - #a Gly Thr Gly Glu Val     #               415     - Glu Val Val Ile Gln Asp Pro Met Gly Gln Ly - #s Gly Thr Val Glu Pro     #           430     - Gln Leu Glu Ala Arg Gly Asp Ser Thr Tyr Ar - #g Cys Ser Tyr Gln Pro     #       445     - Thr Met Glu Gly Val His Thr Val His Val Th - #r Phe Ala Gly Val Pro     #   460     - Ile Pro Arg Ser Pro Tyr Thr Val Thr Val Gl - #y Gln Ala Cys Asn Pro     465                 4 - #70                 4 - #75                 4 -     #80     - Ser Ala Cys Arg Ala Val Gly Arg Gly Leu Gl - #n Pro Lys Gly Val Arg     #               495     - Val Lys Glu Thr Ala Asp Phe Lys Val Tyr Th - #r Lys Gly Ala Gly Ser     #           510     - Gly Glu Leu Lys Val Thr Val Lys Gly Pro Ly - #s Gly Glu Glu Arg Val     #       525     - Lys Gln Lys Asp Leu Gly Asp Gly Val Tyr Gl - #y Phe Glu Tyr Tyr Pro     #   540     - Met Val Pro Gly Thr Tyr Ile Val Thr Ile Th - #r Trp Gly Gly Gln Asn     545                 5 - #50                 5 - #55                 5 -     #60     - Ile Gly Arg Ser Pro Phe Glu Val Lys Val Gl - #y Thr Glu Cys Gly Asn     #               575     - Gln Lys Val Arg Ala Trp Gly Pro Gly Leu Gl - #u Gly Gly Val Val Gly     #           590     - Lys Ser Ala Asp Phe Val Val Glu Ala Ile Gl - #y Asp Asp Val Gly Thr     #       605     - Leu Gly Phe Ser Val Glu Gly Pro Ser Gln Al - #a Lys Ile Glu Cys Asp     #   620     - Asp Lys Gly Asp Gly Ser Cys Asp Val Arg Ty - #r Trp Pro Gln Glu Ala     625                 6 - #30                 6 - #35                 6 -     #40     - Gly Glu Tyr Ala Val His Val Leu Cys Asn Se - #r Glu Asp Ile Arg Leu     #               655     - Ser Pro Phe Met Ala Asp Ile Arg Asp Ala Pr - #o Gln Asp Phe His Pro     #           670     - Asp Arg Val Lys Ala Arg Gly Pro Gly Leu Gl - #u Lys Thr Gly Val Ala     #       685     - Val Asn Lys Pro Ala Glu Phe Thr Val Asp Al - #a Lys His Gly Gly Lys     #   700     - Ala Pro Leu Arg Val Gln Val Gln Asp Asn Gl - #u Gly Cys Pro Val Glu     705                 7 - #10                 7 - #15                 7 -     #20     - Ala Leu Val Lys Asp Asn Gly Asn Gly Thr Ty - #r Ser Cys Ser Tyr Val     #               735     - Pro Arg Lys Pro Val Lys His Thr Ala Met Va - #l Ser Trp Gly Gly Val     #           750     - Ser Ile Pro Asn Ser Pro Phe Arg Val Asn Va - #l Gly Ala Gly Ser His     #       765     - Pro Asn Lys Val Lys Val Tyr Gly Pro Gly Va - #l Ala Lys Thr Gly Leu     #   780     - Lys Ala His Glu Pro Thr Tyr Phe Thr Val As - #p Cys Ala Glu Ala Gly     785                 7 - #90                 7 - #95                 8 -     #00     - Gln Gly Asp Val Ser Ile Gly Ile Lys Cys Al - #a Pro Gly Val Val Gly     #               815     - Pro Ala Glu Ala Asp Ile Asp Phe Asp Ile Il - #e Arg Asn Asp Asn Asp     #           830     - Thr Phe Thr Val Lys Tyr Thr Pro Arg Gly Al - #a Gly Ser Tyr Thr Ile     #       845     - Met Val Leu Phe Ala Asp Gln Ala Thr Pro Th - #r Ser Pro Ile Arg Val     #   860     - Lys Val Glu Pro Ser His Asp Ala Ser Lys Va - #l Lys Ala Glu Gly Pro     865                 8 - #70                 8 - #75                 8 -     #80     - Gly Leu Ser Arg Thr Gly Val Glu Leu Gly Ly - #s Pro Thr His Phe Thr     #               895     - Val Asn Ala Lys Ala Ala Gly Lys Gly Lys Le - #u Asp Val Gln Phe Ser     #           910     - Gly Leu Thr Lys Gly Asp Ala Val Arg Asp Va - #l Asp Ile Ile Asp His     #       925     - His Asp Asn Thr Tyr Thr Val Lys Tyr Thr Pr - #o Val Gln Gln Gly Pro     #   940     - Val Gly Val Asn Val Thr Tyr Gly Gly Asp Pr - #o Ile Pro Lys Ser Pro     945                 9 - #50                 9 - #55                 9 -     #60     - Phe Ser Val Ala Val Ser Pro Ser Leu Asp Le - #u Ser Lys Ile Lys Val     #               975     - Ser Gly Leu Gly Glu Lys Val Asp Val Gly Ly - #s Asp Gln Glu Phe Thr     #           990     - Val Lys Ser Lys Gly Ala Gly Gly Gln Gly Ly - #s Val Ala Ser Lys Ile     #      10050     - Val Gly Pro Ser Gly Ala Ala Val Pro Cys Ly - #s Val Glu Pro Gly Leu     #  10205     - Gly Ala Asp Asn Ser Val Val Arg Phe Leu Pr - #o Arg Glu Glu Gly Pro     #               10401030 - #                1035     - Tyr Glu Val Glu Val Thr Tyr Asp Gly Val Pr - #o Val Pro Gly Ser Pro     #              10550     - Phe Pro Leu Glu Ala Val Ala Pro Thr Lys Pr - #o Ser Lys Val Lys Ala     #          10705     - Phe Gly Pro Gly Leu Gln Gly Gly Ser Ala Gl - #y Ser Pro Ala Arg Phe     #      10850     - Thr Ile Asp Thr Lys Gly Ala Gly Thr Gly Gl - #y Leu Gly Leu Thr Val     #  11005     - Glu Gly Pro Cys Glu Ala Gln Leu Glu Cys Le - #u Asp Asn Gly Asp Gly     #               11201110 - #                1115     - Thr Cys Ser Val Ser Tyr Val Pro Thr Glu Pr - #o Gly Asp Tyr Asn Ile     #              11350     - Asn Ile Leu Phe Ala Asp Thr His Ile Pro Gl - #y Ser Pro Phe Lys Ala     #          11505     - His Val Val Pro Cys Phe Asp Ala Ser Lys Va - #l Lys Cys Ser Gly Pro     #      11650     - Gly Leu Glu Arg Ala Thr Ala Gly Glu Val Gl - #y Gln Phe Gln Val Asp     #  11805     - Cys Ser Ser Ala Gly Ser Ala Glu Leu Thr Il - #e Glu Ile Cys Ser Glu     #               12001190 - #                1195     - Ala Gly Leu Pro Ala Glu Val Tyr Ile Gln As - #p His Gly Asp Gly Thr     #              12150     - His Thr Ile Thr Tyr Ile Pro Leu Cys Pro Gl - #y Ala Tyr Thr Val Thr     #          12305     - Ile Lys Tyr Gly Gly Gln Pro Val Pro Asn Ph - #e Pro Ser Lys Leu Gln     #      12450     - Val Glu Pro Ala Val Asp Thr Ser Gly Val Gl - #n Cys Tyr Gly Pro Gly     #  12605     - Ile Glu Gly Gln Gly Val Phe Arg Glu Ala Th - #r Thr Glu Phe Ser Val     #               12801270 - #                1275     - Asp Ala Arg Ala Leu Thr Gln Thr Gly Gly Pr - #o His Val Lys Ala Arg     #              12950     - Val Ala Asn Pro Ser Gly Asn Leu Thr Glu Th - #r Tyr Val Gln Asp Arg     #          13105     - Gly Asp Gly Met Tyr Lys Val Glu Tyr Thr Pr - #o Tyr Glu Glu Gly Leu     #      13250     - His Ser Val Asp Val Thr Tyr Asp Gly Ser Pr - #o Val Pro Ser Ser Pro     #  13405     - Phe Gln Val Pro Val Thr Glu Gly Cys Asp Pr - #o Ser Arg Val Arg Val     #               13601350 - #                1355     - His Gly Pro Gly Ile Gln Ser Gly Thr Thr As - #n Lys Pro Asn Lys Phe     #              13750     - Thr Val Glu Thr Arg Gly Ala Gly Thr Gly Gl - #y Leu Gly Leu Ala Val     #          13905     - Glu Gly Pro Ser Glu Ala Lys Met Ser Cys Me - #t Asp Asn Lys Asp Gly     #      14050     - Ser Cys Ser Val Glu Tyr Ile Pro Tyr Glu Al - #a Gly Thr Tyr Ser Leu     #  14205     - Asn Val Thr Tyr Gly Gly His Gln Val Pro Gl - #y Ser Pro Phe Lys Val     #               14401430 - #                1435     - Pro Val His Asp Val Thr Asp Ala Ser Lys Va - #l Lys Cys Ser Gly Pro     #              14550     - Gly Leu Ser Pro Gly Met Val Arg Ala Asn Le - #u Pro Gln Ser Phe Gln     #          14705     - Val Asp Thr Ser Lys Ala Gly Val Ala Pro Le - #u Gln Val Lys Val Gln     #      14850     - Gly Pro Lys Gly Leu Val Glu Pro Val Asp Va - #l Val Asp Asn Ala Asp     #  15005     - Gly Thr Gln Thr Val Asn Tyr Val Pro Ser Ar - #g Glu Gly Pro Tyr Ser     #               15201510 - #                1515     - Ile Ser Val Leu Tyr Gly Asp Glu Glu Val Pr - #o Arg Ser Pro Phe Lys     #              15350     - Val Lys Val Leu Pro Thr His Asp Ala Ser Ly - #s Val Lys Ala Ser Gly     #          15505     - Pro Gly Leu Asn Thr Thr Gly Val Pro Ala Se - #r Leu Pro Val Glu Phe     #      15650     - Thr Ile Asp Ala Lys Asp Ala Gly Glu Gly Le - #u Leu Ala Val Gln Ile     #  15805     - Thr Asp Pro Glu Gly Lys Pro Lys Lys Thr Hi - #s Ile Gln Asp Asn His     #               16001590 - #                1595     - Asp Gly Thr Tyr Thr Val Ala Tyr Val Pro As - #p Val Thr Gly Arg Tyr     #              16150     - Thr Ile Leu Ile Lys Tyr Gly Gly Asp Glu Il - #e Pro Phe Ser Pro Tyr     #          16305     - Arg Val Arg Ala Val Pro Thr Gly Asp Ala Se - #r Lys Cys Thr Val Thr     #      16450     - Val Ser Ile Gly Gly His Gly Leu Gly Ala Gl - #y Ile Gly Pro Thr Ile     #  16605     - Gln Ile Gly Glu Glu Thr Val Ile Thr Val As - #p Thr Lys Ala Ala Gly     #               16801670 - #                1675     - Lys Gly Lys Val Thr Cys Thr Val Cys Thr Pr - #o Asp Gly Ser Glu Val     #              16950     - Asp Val Asp Val Val Glu Asn Glu Asp Gly Th - #r Phe Asp Ile Phe Tyr     #          17105     - Thr Ala Pro Gln Pro Gly Lys Tyr Val Ile Cy - #s Val Arg Phe Gly Gly     #      17250     - Glu His Val Pro Asn Ser Pro Phe Gln Val Th - #r Ala Leu Ala Gly Asp     #  17405     - Gln Pro Ser Val Gln Pro Pro Leu Arg Ser Gl - #n Gln Leu Ala Pro Gln     #               17601750 - #                1755     - Tyr Thr Tyr Ala Gln Gly Gly Gln Gln Thr Tr - #p Ala Pro Glu Arg Pro     #              17750     - Leu Val Gly Val Asn Gly Leu Asp Val Thr Se - #r Leu Arg Pro Phe Asp     #          17905     - Leu Val Ile Pro Phe Thr Ile Lys Lys Gly Gl - #u Ile Thr Gly Glu Val     #      18050     - Arg Met Pro Ser Gly Lys Val Ala Gln Pro Th - #r Ile Thr Asp Asn Lys     #  18205     - Asp Gly Thr Val Thr Val Arg Tyr Ala Pro Se - #r Glu Ala Gly Leu His     #               18401830 - #                1835     - Glu Met Asp Ile Arg Tyr Asp Asn Met His Il - #e Pro Gly Ser Pro Leu     #              18550     - Gln Phe Tyr Val Asp Tyr Val Asn Cys Gly Hi - #s Val Thr Ala Tyr Gly     #          18705     - Pro Gly Leu Thr His Gly Val Val Asn Lys Pr - #o Ala Thr Phe Thr Val     #      18850     - Asn Thr Lys Asp Ala Gly Glu Gly Gly Leu Se - #r Leu Ala Ile Glu Gly     #  19005     - Pro Ser Lys Ala Glu Ile Ser Cys Thr Asp As - #n Gln Asp Gly Thr Cys     #               19201910 - #                1915     - Ser Val Ser Tyr Leu Pro Val Leu Pro Gly As - #p Tyr Ser Ile Leu Val     #              19350     - Lys Tyr Asn Glu Gln His Val Pro Gly Ser Pr - #o Phe Thr Ala Arg Val     #          19505     - Thr Gly Asp Asp Ser Met Arg Met Ser His Le - #u Lys Val Gly Ser Ala     #      19650     - Ala Asp Ile Pro Ile Asn Ile Ser Glu Thr As - #p Leu Ser Leu Leu Thr     #  19805     - Ala Thr Val Val Pro Pro Ser Gly Arg Glu Gl - #u Pro Cys Leu Leu Lys     #               20001990 - #                1995     - Arg Leu Arg Asn Gly His Val Gly Ile Ser Ph - #e Val Pro Lys Glu Thr     #              20150     - Gly Glu His Leu Val His Val Lys Lys Asn Gl - #y Gln His Val Ala Ser     #          20305     - Ser Pro Ile Pro Val Val Ile Ser Gln Ser Gl - #u Ile Gly Asp Ala Ser     #      20450     - Arg Val Arg Val Ser Gly Gln Gly Leu His Gl - #u Gly His Thr Phe Glu     #  20605     - Pro Ala Glu Phe Ile Ile Asp Thr Arg Asp Al - #a Gly Tyr Gly Gly Leu     #               20802070 - #                2075     - Ser Leu Ser Ile Glu Gly Pro Ser Lys Val As - #p Ile Asn Thr Glu Asp     #              20950     - Leu Glu Asp Gly Thr Cys Arg Val Thr Tyr Cy - #s Pro Thr Glu Pro Gly     #          21105     - Asn Tyr Ile Ile Asn Ile Lys Phe Ala Asp Gl - #n His Val Pro Gly Ser     #      21250     - Pro Phe Ser Val Lys Val Thr Gly Glu Gly Ar - #g Val Lys Glu Ser Ile     #  21405     - Thr Arg Arg Arg Arg Ala Pro Ser Val Ala As - #n Val Gly Ser His Cys     #               21602150 - #                2155     - Asp Leu Ser Leu Lys Ile Pro Glu Ile Ser Il - #e Gln Asp Met Thr Ala     #              21750     - Gln Val Thr Ser Pro Ser Gly Lys Thr His Gl - #u Ala Glu Ile Val Glu     #          21905     - Gly Glu Asn His Thr Tyr Cys Ile Arg Phe Va - #l Pro Ala Glu Met Gly     #      22050     - Thr His Thr Val Ser Val Lys Tyr Lys Gly Gl - #n His Val Pro Gly Ser     #  22205     - Pro Phe Gln Phe Thr Val Gly Pro Leu Gly Gl - #u Gly Gly Ala His Lys     #               22402230 - #                2235     - Val Arg Ala Gly Gly Pro Gly Leu Glu Arg Al - #a Glu Ala Gly Val Pro     #              22550     - Ala Glu Phe Ser Ile Trp Thr Arg Glu Ala Gl - #y Ala Gly Gly Leu Ala     #          22705     - Ile Ala Val Glu Gly Pro Ser Lys Ala Glu Il - #e Ser Phe Glu Asp Arg     #      22850     - Lys Asp Gly Ser Cys Gly Val Ala Tyr Val Va - #l Gln Glu Pro Gly Asp     #  23005     - Tyr Glu Val Ser Val Lys Phe Asn Glu Glu Hi - #s Ile Pro Asp Ser Pro     #               23202310 - #                2315     - Phe Val Val Pro Val Ala Ser Pro Ser Gly As - #p Ala Arg Arg Leu Thr     #              23350     - Val Ser Ser Leu Gln Glu Ser Gly Leu Lys Va - #l Asn Gln Pro Ala Ser     #          23505     - Phe Ala Val Ser Leu Asn Gly Ala Lys Gly Al - #a Ile Asp Ala Lys Val     #      23650     - His Ser Pro Ser Gly Ala Leu Glu Glu Cys Ty - #r Val Thr Glu Ile Asp     #  23805     - Gln Asp Lys Tyr Ala Val Arg Phe Ile Pro Ar - #g Glu Asn Gly Val Tyr     #               24002390 - #                2395     - Leu Ile Asp Val Lys Phe Asn Gly Thr His Il - #e Pro Gly Ser Pro Phe     #              24150     - Lys Ile Arg Val Gly Glu Pro Gly His Gly Gl - #y Asp Pro Gly Leu Val     #          24305     - Ser Ala Tyr Gly Ala Gly Leu Glu Gly Gly Va - #l Thr Gly Asn Pro Ala     #      24450     - Glu Phe Val Val Asn Thr Ser Asn Ala Gly Al - #a Gly Ala Leu Ser Val     #  24605     - Thr Ile Asp Gly Pro Ser Lys Val Lys Met As - #p Cys Gln Glu Cys Pro     #               24802470 - #                2475     - Glu Gly Tyr Arg Val Thr Tyr Thr Pro Met Al - #a Pro Gly Ser Tyr Leu     #              24950     - Ile Ser Ile Lys Tyr Gly Gly Pro Tyr His Il - #e Gly Gly Ser Pro Phe     #          25105     - Lys Ala Lys Val Thr Gly Pro Arg Leu Val Se - #r Asn His Ser Leu His     #      25250     - Glu Thr Ser Ser Val Phe Val Asp Ser Leu Th - #r Lys Ala Thr Cys Ala     #  25405     - Pro Gln His Gly Ala Pro Gly Pro Gly Pro Al - #a Asp Ala Ser Lys Val     #               25602550 - #                2555     - Val Ala Lys Gly Leu Gly Leu Ser Lys Ala Ty - #r Val Gly Gln Lys Ser     #              25750     - Ser Phe Thr Val Asp Cys Ser Lys Ala Gly As - #n Asn Met Leu Leu Val     #          25905     - Gly Val His Gly Pro Arg Thr Pro Cys Glu Gl - #u Ile Leu Val Lys His     #      26050     - Val Gly Ser Arg Leu Tyr Ser Val Ser Tyr Le - #u Leu Lys Asp Lys Gly     #  26205     - Glu Tyr Thr Leu Val Val Lys Trp Gly His Gl - #u His Ile Pro Gly Ser     #               26402630 - #                2635     - Pro Tyr Arg Val Val Val Pro                     2645     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1125 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1125     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     - TTC GAG ATG TCT GAC TTC ATC GTG GAC ACA AG - #G GAT GCA GGT TAT GGT       48     Phe Glu Met Ser Asp Phe Ile Val Asp Thr Ar - #g Asp Ala Gly Tyr Gly     #                 15     - GGC ATA TCC TTG GCG GTG GAA GGC CCC AGC AA - #A GTG GAC ATC CAG ACG       96     Gly Ile Ser Leu Ala Val Glu Gly Pro Ser Ly - #s Val Asp Ile Gln Thr     #             30     - GAG GAC CTG GAA GAT GGC ACC TGC AAA GTC TC - #C TAC TTC CCT ACC GTG      144     Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Se - #r Tyr Phe Pro Thr Val     #         45     - CCT GGG GTT TAT ATC GTC TCC ACC AAA TTC GC - #T GAC GAG CAC GTG CCT      192     Pro Gly Val Tyr Ile Val Ser Thr Lys Phe Al - #a Asp Glu His Val Pro     #     60     - GGG AGC CCA TTT ACC GTG AAG ATC AGT GGG GA - #G GGA AGA GTC AAA GAG      240     Gly Ser Pro Phe Thr Val Lys Ile Ser Gly Gl - #u Gly Arg Val Lys Glu     # 80     - AGC ATC ACC CGC ACC AGT CGG GCC CCG TCC GT - #G GCC ACT GTC GGG AGC      288     Ser Ile Thr Arg Thr Ser Arg Ala Pro Ser Va - #l Ala Thr Val Gly Ser     #                 95     - ATT TGT GAC CTG AAC CTC AAA ATC CCA GAA AT - #C AAC AGC AGT GAT ATG      336     Ile Cys Asp Leu Asn Leu Lys Ile Pro Glu Il - #e Asn Ser Ser Asp Met     #           110     - TCG GCC CAC GTC ACC AGC CCC TCT GGC CGT GT - #G ACT GAG GCA GAG ATT      384     Ser Ala His Val Thr Ser Pro Ser Gly Arg Va - #l Thr Glu Ala Glu Ile     #       125     - GTG CCC ATG GGG AAG AAC TCA CAC TGC GTC CG - #G TTT GTG CCC CAG GAG      432     Val Pro Met Gly Lys Asn Ser His Cys Val Ar - #g Phe Val Pro Gln Glu     #   140     - ATG GGC GTG CAC ACG GTC AGC GTC AAG TAC CG - #T GGG CAG CAC GTC ACC      480     Met Gly Val His Thr Val Ser Val Lys Tyr Ar - #g Gly Gln His Val Thr     145                 1 - #50                 1 - #55                 1 -     #60     - GGC AGC CCC TTC CAG TTC ACC GTG GGG GCA CT - #T GGT GAA GGA GGC GCC      528     Gly Ser Pro Phe Gln Phe Thr Val Gly Ala Le - #u Gly Glu Gly Gly Ala     #               175     - CAC AAG GTG CGG GCA GGA GGC CCT GGC CTG GA - #G AGA GGA GAA GCG GGA      576     His Lys Val Arg Ala Gly Gly Pro Gly Leu Gl - #u Arg Gly Glu Ala Gly     #           190     - GTC CCA GCT GAG TTC AGC ATT TGG ACC CGG GA - #A GCA GGC GCT GGA GGC      624     Val Pro Ala Glu Phe Ser Ile Trp Thr Arg Gl - #u Ala Gly Ala Gly Gly     #       205     - CTC TCC ATC GCT GTT GAG GGC CCC AGT AAG GC - #C GAG ATT ACA TTC GAT      672     Leu Ser Ile Ala Val Glu Gly Pro Ser Lys Al - #a Glu Ile Thr Phe Asp     #   220     - GAC CAT AAA AAT GGG TCG TGC GGT GTA TCT TA - #T ATT GCC CAA GAG CCT      720     Asp His Lys Asn Gly Ser Cys Gly Val Ser Ty - #r Ile Ala Gln Glu Pro     225                 2 - #30                 2 - #35                 2 -     #40     - GGT AAC TAC GAG GTG TCC ATC AAG TTC AAT GA - #T GAG CAC ATC CCG GAA      768     Gly Asn Tyr Glu Val Ser Ile Lys Phe Asn As - #p Glu His Ile Pro Glu     #               255     - AGC CCC TAC CTG GTG CCG GTC ATC GCA CCC TC - #C GAC GAC GCC CGC CGC      816     Ser Pro Tyr Leu Val Pro Val Ile Ala Pro Se - #r Asp Asp Ala Arg Arg     #           270     - CTC ACT GTT ATG AGC CTT CAG GAA TCG GGA TT - #A AAA GTT AAC CAG CCA      864     Leu Thr Val Met Ser Leu Gln Glu Ser Gly Le - #u Lys Val Asn Gln Pro     #       285     - GCA TCC TTT GCT ATA AGG TTG AAT GGC GCA AA - #A GGC AAG ATT GAT GCA      912     Ala Ser Phe Ala Ile Arg Leu Asn Gly Ala Ly - #s Gly Lys Ile Asp Ala     #   300     - AAG GTG CAC AGC CCC TCT GGA GCC GTG GAG GA - #G TGC CAC GTG TCT GAG      960     Lys Val His Ser Pro Ser Gly Ala Val Glu Gl - #u Cys His Val Ser Glu     305                 3 - #10                 3 - #15                 3 -     #20     - CTG GAG CCA GAT AAG TAT GCT GTT CGC TTC AT - #C CCT CAT GAG AAT GGT     1008     Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe Il - #e Pro His Glu Asn Gly     #               335     - GTC CAC ACC ATC GAT GTC AAG TTC AAT GGG AG - #C CAC GTG GTT GGA AGC     1056     Val His Thr Ile Asp Val Lys Phe Asn Gly Se - #r His Val Val Gly Ser     #           350     - CCC TTC AAA GTG CGC GTT GGG GAG CCT GGA CA - #A GCG GGG AAC CCT GCC     1104     Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gl - #n Ala Gly Asn Pro Ala     #       365     #                1125 GGC ACG     Leu Val Ser Ala Tyr Gly Thr     #   375     - (2) INFORMATION FOR SEQ ID NO:10:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 375 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     - Phe Glu Met Ser Asp Phe Ile Val Asp Thr Ar - #g Asp Ala Gly Tyr Gly     #                 15     - Gly Ile Ser Leu Ala Val Glu Gly Pro Ser Ly - #s Val Asp Ile Gln Thr     #             30     - Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Se - #r Tyr Phe Pro Thr Val     #         45     - Pro Gly Val Tyr Ile Val Ser Thr Lys Phe Al - #a Asp Glu His Val Pro     #     60     - Gly Ser Pro Phe Thr Val Lys Ile Ser Gly Gl - #u Gly Arg Val Lys Glu     # 80     - Ser Ile Thr Arg Thr Ser Arg Ala Pro Ser Va - #l Ala Thr Val Gly Ser     #                 95     - Ile Cys Asp Leu Asn Leu Lys Ile Pro Glu Il - #e Asn Ser Ser Asp Met     #           110     - Ser Ala His Val Thr Ser Pro Ser Gly Arg Va - #l Thr Glu Ala Glu Ile     #       125     - Val Pro Met Gly Lys Asn Ser His Cys Val Ar - #g Phe Val Pro Gln Glu     #   140     - Met Gly Val His Thr Val Ser Val Lys Tyr Ar - #g Gly Gln His Val Thr     145                 1 - #50                 1 - #55                 1 -     #60     - Gly Ser Pro Phe Gln Phe Thr Val Gly Ala Le - #u Gly Glu Gly Gly Ala     #               175     - His Lys Val Arg Ala Gly Gly Pro Gly Leu Gl - #u Arg Gly Glu Ala Gly     #           190     - Val Pro Ala Glu Phe Ser Ile Trp Thr Arg Gl - #u Ala Gly Ala Gly Gly     #       205     - Leu Ser Ile Ala Val Glu Gly Pro Ser Lys Al - #a Glu Ile Thr Phe Asp     #   220     - Asp His Lys Asn Gly Ser Cys Gly Val Ser Ty - #r Ile Ala Gln Glu Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Asn Tyr Glu Val Ser Ile Lys Phe Asn As - #p Glu His Ile Pro Glu     #               255     - Ser Pro Tyr Leu Val Pro Val Ile Ala Pro Se - #r Asp Asp Ala Arg Arg     #           270     - Leu Thr Val Met Ser Leu Gln Glu Ser Gly Le - #u Lys Val Asn Gln Pro     #       285     - Ala Ser Phe Ala Ile Arg Leu Asn Gly Ala Ly - #s Gly Lys Ile Asp Ala     #   300     - Lys Val His Ser Pro Ser Gly Ala Val Glu Gl - #u Cys His Val Ser Glu     305                 3 - #10                 3 - #15                 3 -     #20     - Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe Il - #e Pro His Glu Asn Gly     #               335     - Val His Thr Ile Asp Val Lys Phe Asn Gly Se - #r His Val Val Gly Ser     #           350     - Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gl - #n Ala Gly Asn Pro Ala     #       365     - Leu Val Ser Ala Tyr Gly Thr     #   375     - (2) INFORMATION FOR SEQ ID NO:11:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1494 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1449     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     - AAA ATC CCA GAA ATC AAC AGC AGT GAT ATG TC - #G GCC CAC GTC ACC AGC       48     Lys Ile Pro Glu Ile Asn Ser Ser Asp Met Se - #r Ala His Val Thr Ser     #                 15     - CCC TCT GGC CGT GTG ACT GAG GCA GAG ATT GT - #G CCC ATG GGG AAG AAC       96     Pro Ser Gly Arg Val Thr Glu Ala Glu Ile Va - #l Pro Met Gly Lys Asn     #             30     - TCA CAC TGC GTC CGG TTT GTG CCC CAG GAG AT - #G GGC GTG CAC ACG GTC      144     Ser His Cys Val Arg Phe Val Pro Gln Glu Me - #t Gly Val His Thr Val     #         45     - AGC GTC AAG TAC CGT GGG CAG CAC GTC ACC GG - #C AGC CCC TTC CAG TTC      192     Ser Val Lys Tyr Arg Gly Gln His Val Thr Gl - #y Ser Pro Phe Gln Phe     #     60     - ACC GTG GGG GCA CTT GGT GAA GGA GGC GCC CA - #C AAG GTG CGG GCA GGA      240     Thr Val Gly Ala Leu Gly Glu Gly Gly Ala Hi - #s Lys Val Arg Ala Gly     # 80     - GGC CCT GGC CTG GAG AGA GGA GAA GCG GGA GT - #C CCA GCT GAG TTC AGC      288     Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly Va - #l Pro Ala Glu Phe Ser     #                 95     - ATT TGG ACC CGG GAA GCA GGC GCT GGA GGC CT - #C TCC ATC GCT GTT GAG      336     Ile Trp Thr Arg Glu Ala Gly Ala Gly Gly Le - #u Ser Ile Ala Val Glu     #           110     - GGC CCC AGT AAG GCC GAG ATT ACA TTC GAT GA - #C CAT AAA AAT GGG TCG      384     Gly Pro Ser Lys Ala Glu Ile Thr Phe Asp As - #p His Lys Asn Gly Ser     #       125     - TGC GGT GTA TCT TAT ATT GCC CAA GAG CCT GG - #T AAC TAC GAG GTG TCC      432     Cys Gly Val Ser Tyr Ile Ala Gln Glu Pro Gl - #y Asn Tyr Glu Val Ser     #   140     - ATC AAG TTC AAT GAT GAG CAC ATC CCG GAA AG - #C CCC TAC CTG GTG CCG      480     Ile Lys Phe Asn Asp Glu His Ile Pro Glu Se - #r Pro Tyr Leu Val Pro     145                 1 - #50                 1 - #55                 1 -     #60     - GTC ATC GCA CCC TCC GAC GAC GCC CGC CGC CT - #C ACT GTT ATG AGC CTT      528     Val Ile Ala Pro Ser Asp Asp Ala Arg Arg Le - #u Thr Val Met Ser Leu     #               175     - CAG GAA TCG GGA TTA AAA GTT AAC CAG CCA GC - #A TCC TTT GCT ATA AGG      576     Gln Glu Ser Gly Leu Lys Val Asn Gln Pro Al - #a Ser Phe Ala Ile Arg     #           190     - TTG AAT GGC GCA AAA GGC AAG ATT GAT GCA AA - #G GTG CAC AGC CCC TCT      624     Leu Asn Gly Ala Lys Gly Lys Ile Asp Ala Ly - #s Val His Ser Pro Ser     #       205     - GGA GCC GTG GAG GAG TGC CAC GTG TCT GAG CT - #G GAG CCA GAT AAG TAT      672     Gly Ala Val Glu Glu Cys His Val Ser Glu Le - #u Glu Pro Asp Lys Tyr     #   220     - GCT GTT CGC TTC ATC CCT CAT GAG AAT GGT GT - #C CAC ACC ATC GAT GTC      720     Ala Val Arg Phe Ile Pro His Glu Asn Gly Va - #l His Thr Ile Asp Val     225                 2 - #30                 2 - #35                 2 -     #40     - AAG TTC AAT GGG AGC CAC GTG GTT GGA AGC CC - #C TTC AAA GTG CGC GTT      768     Lys Phe Asn Gly Ser His Val Val Gly Ser Pr - #o Phe Lys Val Arg Val     #               255     - GGG GAG CCT GGA CAA GCG GGG AAC CCT GCC CT - #G GTG TCC GCC TAT GGC      816     Gly Glu Pro Gly Gln Ala Gly Asn Pro Ala Le - #u Val Ser Ala Tyr Gly     #           270     - ACG GGA CTC GAA GGG GGN ACC ACA GGT ATC CA - #G TCG GAA TTC TTT ATT      864     Thr Gly Leu Glu Gly Xaa Thr Thr Gly Ile Gl - #n Ser Glu Phe Phe Ile     #       285     - AAC ACC ACC CGA GCA GGT CCA GGG ACA TTA TC - #C GTC ACC ATC GAA GGC      912     Asn Thr Thr Arg Ala Gly Pro Gly Thr Leu Se - #r Val Thr Ile Glu Gly     #   300     - CCA TCC AAG GTT AAA ATG GAT TGC CAG GAA AC - #A CCT GAA GGG TAC AAA      960     Pro Ser Lys Val Lys Met Asp Cys Gln Glu Th - #r Pro Glu Gly Tyr Lys     305                 3 - #10                 3 - #15                 3 -     #20     - GTC ATG TAC ACC CCC ATG GCT CCT GGT AAC TA - #C CTG ATC AGT GTC AAA     1008     Val Met Tyr Thr Pro Met Ala Pro Gly Asn Ty - #r Leu Ile Ser Val Lys     #               335     - TAC GGT GGG CCC AAC CAC ATC GTG GGC AGT CC - #C TTC AAG GCC AAG GTG     1056     Tyr Gly Gly Pro Asn His Ile Val Gly Ser Pr - #o Phe Lys Ala Lys Val     #           350     - ACT GGC CAG CGT CTA GTT AGC CCT GGC TCA GC - #C AAC GAG ACC TCA TCC     1104     Thr Gly Gln Arg Leu Val Ser Pro Gly Ser Al - #a Asn Glu Thr Ser Ser     #       365     - ATC CTG GTG GAG TCA GTG ACC AGG TCG TCT AC - #A GAG ACC TGC TAT AGC     1152     Ile Leu Val Glu Ser Val Thr Arg Ser Ser Th - #r Glu Thr Cys Tyr Ser     #   380     - GCC ATT CCC AAG GCA TCC TCG GAC GCC AGC AA - #G GTG ACC TCT AAG GGG     1200     Ala Ile Pro Lys Ala Ser Ser Asp Ala Ser Ly - #s Val Thr Ser Lys Gly     385                 3 - #90                 3 - #95                 4 -     #00     - GCA GGG CTC TCA AAG GCC TTT GTG GGC CAG AA - #G AGT TCC TTC CTG GTG     1248     Ala Gly Leu Ser Lys Ala Phe Val Gly Gln Ly - #s Ser Ser Phe Leu Val     #               415     - GAC TGC AGC AAA GCT GGC TCC AAC ATG CTG CT - #G ATC GGG GTC CAT GGG     1296     Asp Cys Ser Lys Ala Gly Ser Asn Met Leu Le - #u Ile Gly Val His Gly     #           430     - CCC ACC ACC CCC TGC GAG GAG GTC TCC ATG AA - #G CAT GTA GGC AAC CAG     1344     Pro Thr Thr Pro Cys Glu Glu Val Ser Met Ly - #s His Val Gly Asn Gln     #       445     - CAA TAC AAC GTC ACA TAC GTC GTC AAG GAG AG - #G GGC GAT TAT GTG CTG     1392     Gln Tyr Asn Val Thr Tyr Val Val Lys Glu Ar - #g Gly Asp Tyr Val Leu     #   460     - GCT GTG AAG TGG GGG GAG GAA CAC ATC CCT GG - #C AGC CCT TTT CAT GTC     1440     Ala Val Lys Trp Gly Glu Glu His Ile Pro Gl - #y Ser Pro Phe His Val     465                 4 - #70                 4 - #75                 4 -     #80     - ACA GTG CCT TAAAACAGTT TTCTCAAATC CTGGAAAAAA AAAAAAAAA - #A AAAAA     1494     Thr Val Pro     - (2) INFORMATION FOR SEQ ID NO:12:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 483 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     - Lys Ile Pro Glu Ile Asn Ser Ser Asp Met Se - #r Ala His Val Thr Ser     #                 15     - Pro Ser Gly Arg Val Thr Glu Ala Glu Ile Va - #l Pro Met Gly Lys Asn     #             30     - Ser His Cys Val Arg Phe Val Pro Gln Glu Me - #t Gly Val His Thr Val     #         45     - Ser Val Lys Tyr Arg Gly Gln His Val Thr Gl - #y Ser Pro Phe Gln Phe     #     60     - Thr Val Gly Ala Leu Gly Glu Gly Gly Ala Hi - #s Lys Val Arg Ala Gly     # 80     - Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly Va - #l Pro Ala Glu Phe Ser     #                 95     - Ile Trp Thr Arg Glu Ala Gly Ala Gly Gly Le - #u Ser Ile Ala Val Glu     #           110     - Gly Pro Ser Lys Ala Glu Ile Thr Phe Asp As - #p His Lys Asn Gly Ser     #       125     - Cys Gly Val Ser Tyr Ile Ala Gln Glu Pro Gl - #y Asn Tyr Glu Val Ser     #   140     - Ile Lys Phe Asn Asp Glu His Ile Pro Glu Se - #r Pro Tyr Leu Val Pro     145                 1 - #50                 1 - #55                 1 -     #60     - Val Ile Ala Pro Ser Asp Asp Ala Arg Arg Le - #u Thr Val Met Ser Leu     #               175     - Gln Glu Ser Gly Leu Lys Val Asn Gln Pro Al - #a Ser Phe Ala Ile Arg     #           190     - Leu Asn Gly Ala Lys Gly Lys Ile Asp Ala Ly - #s Val His Ser Pro Ser     #       205     - Gly Ala Val Glu Glu Cys His Val Ser Glu Le - #u Glu Pro Asp Lys Tyr     #   220     - Ala Val Arg Phe Ile Pro His Glu Asn Gly Va - #l His Thr Ile Asp Val     225                 2 - #30                 2 - #35                 2 -     #40     - Lys Phe Asn Gly Ser His Val Val Gly Ser Pr - #o Phe Lys Val Arg Val     #               255     - Gly Glu Pro Gly Gln Ala Gly Asn Pro Ala Le - #u Val Ser Ala Tyr Gly     #           270     - Thr Gly Leu Glu Gly Xaa Thr Thr Gly Ile Gl - #n Ser Glu Phe Phe Ile     #       285     - Asn Thr Thr Arg Ala Gly Pro Gly Thr Leu Se - #r Val Thr Ile Glu Gly     #   300     - Pro Ser Lys Val Lys Met Asp Cys Gln Glu Th - #r Pro Glu Gly Tyr Lys     305                 3 - #10                 3 - #15                 3 -     #20     - Val Met Tyr Thr Pro Met Ala Pro Gly Asn Ty - #r Leu Ile Ser Val Lys     #               335     - Tyr Gly Gly Pro Asn His Ile Val Gly Ser Pr - #o Phe Lys Ala Lys Val     #           350     - Thr Gly Gln Arg Leu Val Ser Pro Gly Ser Al - #a Asn Glu Thr Ser Ser     #       365     - Ile Leu Val Glu Ser Val Thr Arg Ser Ser Th - #r Glu Thr Cys Tyr Ser     #   380     - Ala Ile Pro Lys Ala Ser Ser Asp Ala Ser Ly - #s Val Thr Ser Lys Gly     385                 3 - #90                 3 - #95                 4 -     #00     - Ala Gly Leu Ser Lys Ala Phe Val Gly Gln Ly - #s Ser Ser Phe Leu Val     #               415     - Asp Cys Ser Lys Ala Gly Ser Asn Met Leu Le - #u Ile Gly Val His Gly     #           430     - Pro Thr Thr Pro Cys Glu Glu Val Ser Met Ly - #s His Val Gly Asn Gln     #       445     - Gln Tyr Asn Val Thr Tyr Val Val Lys Glu Ar - #g Gly Asp Tyr Val Leu     #   460     - Ala Val Lys Trp Gly Glu Glu His Ile Pro Gl - #y Ser Pro Phe His Val     465                 4 - #70                 4 - #75                 4 -     #80     - Thr Val Pro     - (2) INFORMATION FOR SEQ ID NO:13:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 31 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     #          31      SNYK SATTTNRADS T     - (2) INFORMATION FOR SEQ ID NO:14:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 18 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:     #  18              SN     - (2) INFORMATION FOR SEQ ID NO:15:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 22 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:     #                 22NYK SA     - (2) INFORMATION FOR SEQ ID NO:16:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 26 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:     #              26  SNYK SATTTN     - (2) INFORMATION FOR SEQ ID NO:17:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 12 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:     #       12     - (2) INFORMATION FOR SEQ ID NO:18:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 11 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:     #       11     - (2) INFORMATION FOR SEQ ID NO:19:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:     #         33       TAAG GATTACTGTC CTG     - (2) INFORMATION FOR SEQ ID NO:20:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:     #         33       TAGG CACTTTTGTA GAG     - (2) INFORMATION FOR SEQ ID NO:21:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:     #         33       TAAG GATTGATGGT GGT     - (2) INFORMATION FOR SEQ ID NO:22:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:     #         33       TACT CCTTCTCAAA GCG     - (2) INFORMATION FOR SEQ ID NO:23:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 34 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:     #        34        CTAG CGACTGTATT CCCG     - (2) INFORMATION FOR SEQ ID NO:24:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:     #           30     NYKS ATTTNRADST     - (2) INFORMATION FOR SEQ ID NO:25:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:     #           30     NYKS ATTTNRADST     - (2) INFORMATION FOR SEQ ID NO:26:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:     #           30     NYKS ATTTNRADST     - (2) INFORMATION FOR SEQ ID NO:27:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:     #           30     NYKS ATTTNRADST     - (2) INFORMATION FOR SEQ ID NO:28:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:     #         33       GCGA GCCGGTATCC GAG     - (2) INFORMATION FOR SEQ ID NO:29:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 35 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:     #       35         TGCA CCGAGAGCCG GTATC     - (2) INFORMATION FOR SEQ ID NO:30:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:     #         33       GCGT CATAGATTTC CAC     - (2) INFORMATION FOR SEQ ID NO:31:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:     #         33       GCGT ATTCCCGGCG GTC     - (2) INFORMATION FOR SEQ ID NO:32:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:     #           30     CCCC CATGGCACCT     - (2) INFORMATION FOR SEQ ID NO:33:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 28 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:     #             28   CACC ACAACGCG     - (2) INFORMATION FOR SEQ ID NO:34:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 32 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:     #          32      GCTC TTCTGGCCCT AC     - (2) INFORMATION FOR SEQ ID NO:35:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 28 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:     #             28   CCCA TGGCTCCT     - (2) INFORMATION FOR SEQ ID NO:36:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 27 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:     #             27   CACA AACAGGC     __________________________________________________________________________ 

What is claimed is:
 1. A purified and isolated polynucleotide encoding the β₇ integrin cytoplasmic domain-binding human FLP-1 (Flamin-like binding proteins) amino acid sequence set out in SEQ ID NO:
 2. 2. The polynucleotide of claim 1 which is a DNA molecule.
 3. The polynucleotide of claim 2, wherein tne DNA molecule which is selected from the group consisting of cDNA, genomic DNA, partially chemically synthesized DNA, and wholly chemically synthesized DNA.
 4. A purified and isolated DNA molecule consisting of the human FLP-1 polypeptide encoding sequence set out in SEQ ID:
 1. 5. A purified and isolated DNA molecule encoding a β₇ integrin cytoplasmic domain-binding FLP-1 polypeptide selected from the group consisting of:a) the human DNA sequence set out in SEQ ID NO: 1; b) a DNA molecule which hybridizes under stringent conditions to the noncoding strand of the protein coding portion of (a); and c) a DNA molecule that would hybridize to the DNA of (a) but for the degeneracy of the genetic code.
 6. A DNA expression construct comprising the DNA molecule of claim 4, or
 5. 7. A host cell transformed or transfected with the expression construct of claim
 6. 8. A method for producing a β₇ integrin cytoylasmic domain-binding FLP-1 polypeptide comprising growing the host cell of claim 7 in a suitable medium and isolating a FLP-1 polypeptide from the host cell or the medium of its growth. 